Gene Francci3_0809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0809 
Symbol 
ID3906436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp940356 
End bp941789 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content67% 
IMG OID637878142 
Producthypothetical protein 
Protein accessionYP_479922 
Protein GI86739522 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACACG CACCATCACA CGAAAGGATC ATCAACCAGG GAGACGTCCC ACCCGCCGAG 
CAGTCGCACG ACGAGCGCCT GATCAACCGA GGCATCGAGG CCGCCGCCCA GGAGGAGCGG
CCCATCGACG ACCGCACCGC GCGCTACATC GCCGGACAGC TGCACGGTGG TCAGGTCAGC
GCCCTCTACA GCCTGGCCAG CACCGGCAAC ATTATCGAGG ACACCGTCTA CCACGAGCTG
TACGAAGACC TCGAATCCCA GACGCCGGAA GTCGCCTCGT GGGTCGAAGC GCTGCGAACC
TACTGCCAGG CCAGGCCCGA CAAGGGTCCG GTGAGCGGCT GGGCCGAGCA CGCCGCTCTC
CTTGACCGCA TCGAGGCCAC GCGGGAGCGC ACGCGGATGC TCGGCGGGAT CGCCGTCGCC
CCCGAACTGG CCGAAGCCGA CGAGACGTCC AGGAAAGATC ATTTCCTGCA TCCAGACCGA
CTGAACGAAC TGTTCGGCGA ACCGCCGGAC GAGGAGATCG GCCGCGCCGA GGAGCTGGGC
TGGTTCGGCC TGATCGTCGA CCACGGCACC GGCGGCGGCA CGATCATCTC CCAGGACGAG
CAGGGCTTTC GGTACGTCTG GGAGACGGAG GACGGTGAAG CCCTCGACCA GCGGTGGCAG
GCCATCCTCC GGGAGTACCG ACGCTACGAA GACGCTCTTG TCCAGCTCGA ACGACACGAG
CAGGACGACC GGTGCGAACG GGTCGGCTAC GCCTGCCCGG AGTGCGAAGA GCAGATCATC
GAACACTCGG TCGGACTTGA CGAGTCCACC TGGACGCACC AGGACGGCGG ACCGCTGTGC
CCCGTCGTCG GAGACGGCGG CTACCAGCCG GCGCAACCAG GCCTCTGGCG GGACGGCGAG
ATCGTGCCCC TGGCTGAGCA AGCTGACGGG GACGACGCCC ATGGTGACGG ACCCCGCGTC
TACGTGGCCA GCCTCGCCGA CTACACCAAC GGCGAGCTAC ACGGCCGCTG GATCGCCGCC
GACCATGATG TCGAAGACCT GGAGGGCGCC GTTGCCCGCA TCCTGGCGAC CTCACCAGCC
CGGCGGCACG GCGAGGCGGC CGAGGAATGG GCCATCCACG ACTACGAGGG CTTCGATGAG
GAGGTCACGT CCACGCTGGG CGAGGGCCGT CGCTACGACC GTTTGACCCC GCCACCGTTC
AATCTTGGTG CAGGGTCCCG AAATCACTGT CCGCCCAGTC GCCAAGCGGC TCCTGACCAG
GGACGATGCC CAGTGAGTCT CCGACCGACA ACATCATCCA CCGAGCCGAA CATGTGTTCC
GTGGCTCTGG TGGGTCCGGA CACCGCGGTC CGTCTGGTGA CCGCGATCAT GGGCACGGTC
GAGCTGGAAT CCGGATGGGG ACCGGTCGTC ACTCAGCCGC CGGTGATTTC CTGA
 
Protein sequence
MEHAPSHERI INQGDVPPAE QSHDERLINR GIEAAAQEER PIDDRTARYI AGQLHGGQVS 
ALYSLASTGN IIEDTVYHEL YEDLESQTPE VASWVEALRT YCQARPDKGP VSGWAEHAAL
LDRIEATRER TRMLGGIAVA PELAEADETS RKDHFLHPDR LNELFGEPPD EEIGRAEELG
WFGLIVDHGT GGGTIISQDE QGFRYVWETE DGEALDQRWQ AILREYRRYE DALVQLERHE
QDDRCERVGY ACPECEEQII EHSVGLDEST WTHQDGGPLC PVVGDGGYQP AQPGLWRDGE
IVPLAEQADG DDAHGDGPRV YVASLADYTN GELHGRWIAA DHDVEDLEGA VARILATSPA
RRHGEAAEEW AIHDYEGFDE EVTSTLGEGR RYDRLTPPPF NLGAGSRNHC PPSRQAAPDQ
GRCPVSLRPT TSSTEPNMCS VALVGPDTAV RLVTAIMGTV ELESGWGPVV TQPPVIS