Gene Francci3_4312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4312 
Symbol 
ID3907281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5151143 
End bp5152264 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content72% 
IMG OID637881640 
Producthypothetical protein 
Protein accessionYP_483387 
Protein GI86742987 
COG category[S] Function unknown 
COG ID[COG5282] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03624] putative hydrolase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.658784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.614415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGCCG AGCTGGTCGA CTGGGAACTC GCCGTGACGA CGGCGAGGAA ACTGGTTCGC 
CCCGGCCCGC AGATCTCGCG GGAGCAGGCC GGCGAGGTCG TCTCGGAGCT GCGGCGACTG
GCCGTCGAGG CCGAGCGGCA CGTCGAGGAC TACACCCGGC TCACTCCCGC CGGGCCGGCC
ACCCCCATCA CCGTGGTGGA CCGGCTGGAG TGGGTCCGCT CCAACGTCGC CGGTCTGCGG
GTGCTGACCT CCCCGTTGCT GGGCAAACTG TCCGAACAGC AGCGCGGCAA CCTCGCCGCG
GCGGTCGGTC GGCGGGTCAC CGGGGTCCAG ATCGGCACAG CCTTGGCCTA TCTGGCCGGC
AAGGTGCTGG GCCAGTACGA GGTCTTCCTC CCGCCGGAGG AGTACCAGGC CGGGAGGGAC
GATCCGGGTC CGCCCGACGT GCCGGGGTCG GTCGGGCGCC TCAGCCTGGT CGCGCCGAAC
ATCGCGCACG CCGAGGCGGC CATGGGGGTC GTGCCGCGAG ACTTCCGGAT GTGGGTATGC
CTGCACGAGC AGACCCATCG CAGTCAGTTC ACCGCCGTGC CGTGGCTGCG GACCCACCTG
GAGTCCGAGA TCACCGCGTT CATTGAGGCC ACGGACCTCG ATCCGGACGT CCTCGCGGAC
CGCGTCCGCT CGGCGATGAG CGCGCTGCGC GGGGCGGTGT TCGACCGGGA GGCCGACAGC
CCGAGCGTCG TGGAGGCGCT GCAGACCCCG GCCCAGCGAG CGGTTCTCGA CCGCCTCCAG
GCCATCATGA CGTTGTTGGA GGGCCACGCC GACCAGGTGA TGGACGCGGT GGGGCCGAAG
ATCGTACCCA CCGTCGCCGA CATCCGGACG AAGTTCGAGG GTCGCCGCGG CGGGGGATCG
CCGATCGACC GGTTCGTCCG CCGGCTCCTC GGTCTCGATC TCAAGCTGCA GCAGTACCGC
CAGGGCGGGG CGTTCGTGCG TGCGGTGGTC GCGCAGGCCG GGGTGACGGG TTTCAACACC
GTCTGGGAGT CACCGGCGGC GCTGCCGACC CGCGCCGAGC TGGTCGATCC CACCGCCTGG
ATGCAACGGG TCCTGGGCAG CCGCCCGCCG ATCTCGGCGT GA
 
Protein sequence
MDAELVDWEL AVTTARKLVR PGPQISREQA GEVVSELRRL AVEAERHVED YTRLTPAGPA 
TPITVVDRLE WVRSNVAGLR VLTSPLLGKL SEQQRGNLAA AVGRRVTGVQ IGTALAYLAG
KVLGQYEVFL PPEEYQAGRD DPGPPDVPGS VGRLSLVAPN IAHAEAAMGV VPRDFRMWVC
LHEQTHRSQF TAVPWLRTHL ESEITAFIEA TDLDPDVLAD RVRSAMSALR GAVFDREADS
PSVVEALQTP AQRAVLDRLQ AIMTLLEGHA DQVMDAVGPK IVPTVADIRT KFEGRRGGGS
PIDRFVRRLL GLDLKLQQYR QGGAFVRAVV AQAGVTGFNT VWESPAALPT RAELVDPTAW
MQRVLGSRPP ISA