Gene Franean1_5057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5057 
SymbolengA 
ID5673393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6055362 
End bp6056750 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content75% 
IMG OID641243908 
ProductGTP-binding protein EngA 
Protein accessionYP_001509323 
Protein GI158316815 
COG category[R] General function prediction only 
COG ID[COG1160] Predicted GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03594] ribosome-associated GTPase EngA 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0472211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCG AAGATCTCGC CGCGTCCGTC GACGACGGCC CTGTGGGCGG TGCCCTCCCG 
CTGGCGGGAG GGCAGCCCGT CCTCGCCGTC GTCGGCCGGC CGAACGTGGG CAAGTCGACG
CTGGTCAACC GCATCCTGGG CCGCCGCGCG GCCGTCGTCG AGGACGTCCC CGGCGTCACC
CGTGACCGGG TCGCCTACGA CGCGGTGTGG AACGGCCGCC GGTTCACCCT CGTCGACACC
GGCGGCTGGG AGCCCGACGC CCGCGGCCTC GCCGCCCGGG TCTCCGACCA GGCCCGCGCC
GCGCTCGACA CCGCCGACGG CGTGCTGTTC GTGATCGACG CCACCGTCGG CGCGACCGAC
GCCGACGAGG CCGTCGCCCG GGTGCTGCAC CGGTCGGGCC GGCCGGTGAT CCTCGCCGCG
AACAAGGTCG ACGACGCCCG CGCGGAGGCC GACGCCGCCG CGCTGTGGAG CCTCGGGCTG
GGCGAGCCGT ACCCGGTGTC CGCGCTGCAC GGCCGGGGCA GCGGCGACCT GCTGGACGCC
GTCCTCGCGG TGCTGCCCGA GGCACCCCGC GAGCGGTTCA CCGAGGAGGA CGGCCCCCGG
CGCGTGGCGC TGATCGGGCG GCCGAACGTC GGCAAGTCCA GCCTGCTCAA CAAACTGGCC
GGCAGCGAGC GCTCGCTGGT GCACGACGTC GCGGGCACGA CCCGCGACCC GGTGGACGAG
CTCGTCACCG TCGGCGGCGA GACCTGGATG TTCATCGACA CCGCCGGCCT GCGGCGGCGG
GTGAAGGAGG CCTCCGGCGC CGAGTACTAC TCGTCGCTGC GCACCGCCTC CGCGCTGGAG
GCCGCCGAGG TCGCGATCGT CCTGCTCGCC GCGGACGAGC CGGTCACCGA GCAGGACCAG
CGGATCATCA GCATGGTCAC CGACGCCGGC CGGGCCCTCG TCCTCGCCTT CAACAAGTGG
GACACGCTCG ACACCGAGCG CCGTCTCGAC CTGGAGCAGG AGATCGTCCG CGAGCTGGGC
CGGGTGGCCT GGGCGCCGCG GGTGAACATC TCGGCCCGCA CCGGCCGCGC CACCGACCGG
CTCGCCCCGG CGCTGCGGAC GTCCCTCGAC TCGTGGGGAA CGCGCATCCC GACCGGCCGC
CTCAACGCCT GGATCGGAGA GGTCGTGGCG GCCACGCCGC CGCCGTCGCG GGGCGGGAAG
CTGCCGCGGG TGCTGTTCGC GACCCAGGCC GGGGTGCGCC CGCCGCGCTT CGTCGTGTTC
ACCACCGGAT TCCTCGAGCC GGCCTACCGG CGTTTCCTGG AGCGCAAACT GCGCGAGGAC
TTCGGCTTCG CCGGCACGCC CATCGAGATC TCGATCCGGG TCCGCGAGCG TCCCGACCGC
CACCGCTAG
 
Protein sequence
MNTEDLAASV DDGPVGGALP LAGGQPVLAV VGRPNVGKST LVNRILGRRA AVVEDVPGVT 
RDRVAYDAVW NGRRFTLVDT GGWEPDARGL AARVSDQARA ALDTADGVLF VIDATVGATD
ADEAVARVLH RSGRPVILAA NKVDDARAEA DAAALWSLGL GEPYPVSALH GRGSGDLLDA
VLAVLPEAPR ERFTEEDGPR RVALIGRPNV GKSSLLNKLA GSERSLVHDV AGTTRDPVDE
LVTVGGETWM FIDTAGLRRR VKEASGAEYY SSLRTASALE AAEVAIVLLA ADEPVTEQDQ
RIISMVTDAG RALVLAFNKW DTLDTERRLD LEQEIVRELG RVAWAPRVNI SARTGRATDR
LAPALRTSLD SWGTRIPTGR LNAWIGEVVA ATPPPSRGGK LPRVLFATQA GVRPPRFVVF
TTGFLEPAYR RFLERKLRED FGFAGTPIEI SIRVRERPDR HR