Gene Francci3_3871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3871 
Symbol 
ID3906639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4633497 
End bp4634573 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content69% 
IMG OID637881197 
ProductGTP-dependent nucleic acid-binding protein EngD 
Protein accessionYP_482950 
Protein GI86742550 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0012] Predicted GTPase, probable translation factor 
TIGRFAM ID[TIGR00092] GTP-binding protein YchF 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.479028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTGT CGATCGGGAT CGTCGGGCTG CCCAACGTCG GCAAGTCCAC GTTGTTCAAC 
GCCCTGACGC GCAATGAGGT GCTGGCCGCG AACTACCCGT TCGCGACGAT CGAGCCGAAC
GTGGGCGTGG TCGGGGTGCC CGATCCGCGT CTCGGCGAGC TCGCGAAGCT CTACGACAGC
GCCCGGACGG TGCCGGCCAC GGTCAGCTTC GTGGACATCG CCGGTCTGGT CCGAGGCGCG
TCCGAGGGGC AGGGCCTGGG TAACCGCTTC CTCGCGAACA TCCGCGAGTC CGACGCGGTC
TGCCAGGTCG TCCGGGTGTT CTCCGACCCC GACGTGGTGC ACGTCGAGGG CAGGGTCGAC
CCGGCCGACG ACATCGAGAC GATCAACACC GAGCTGATCC TCGCCGATCT GCAGACCGTT
GACGCGCGCC TGCCGAAGCT GGAGAAGGAG GCCCGTGCCG ACAAGGCGAA GCAGCCGTTG
CTGGCCGCGG TGAAGGCCGC GCGCGAGGTG CTTGACGCCG GCCGCACGCT GTCCTCGGAA
CCGAAGATCG ATCGCGACGC CCTGCGGGAG CTTTTCCTGC TCACCGCCAA GCCCTTCCTC
TACGTCTTCA ACGTCGACGA GGACGTCCTC GCCGATCCCG GTCGGCGCAA GGAACTTGTC
GGCTCCGTCG CGCCCGCGGA CGCGATCGTG CTGTGCGCCA AGGTCGAGGC CGAACTGGCC
GAGCTCGACG AGGCCGACGC CGCGGAGCTG CTGGCCTCGC TCGGCCAGGA GGAGAGCGGC
CTTGCCCAGC TGGCCCGGAT CGGTTTCCAC ACCCTGGGGC TCCAGACGTT CCTGACGGCA
GGCCCGAAGG AGGCCCGGGC CTGGACCATC AGAGCCGGGG CGACCGCGCC GGAGGCCGCC
GGGGCCATCC ACACCGACTT CCAGCGCGGC TTCATCAAGG CCGAGATCGT CTCGTACGAC
GCCCTGATCG CGGCCGGTTC GATGGCCGCC GCCCGCGCGG CCGGCAAGGT GCGCATGGAG
GGCAAGGACT ACGTGATGGC CGACGGCGAC GTCGTGGAGT TCCGCTTCAA CGTCTGA
 
Protein sequence
MGLSIGIVGL PNVGKSTLFN ALTRNEVLAA NYPFATIEPN VGVVGVPDPR LGELAKLYDS 
ARTVPATVSF VDIAGLVRGA SEGQGLGNRF LANIRESDAV CQVVRVFSDP DVVHVEGRVD
PADDIETINT ELILADLQTV DARLPKLEKE ARADKAKQPL LAAVKAAREV LDAGRTLSSE
PKIDRDALRE LFLLTAKPFL YVFNVDEDVL ADPGRRKELV GSVAPADAIV LCAKVEAELA
ELDEADAAEL LASLGQEESG LAQLARIGFH TLGLQTFLTA GPKEARAWTI RAGATAPEAA
GAIHTDFQRG FIKAEIVSYD ALIAAGSMAA ARAAGKVRME GKDYVMADGD VVEFRFNV