Gene Francci3_3928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3928 
Symbol 
ID3906887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4700140 
End bp4701324 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content72% 
IMG OID637881255 
Productnucleoside triphosphate pyrophosphohydrolase 
Protein accessionYP_483007 
Protein GI86742607 
COG category[R] General function prediction only 
COG ID[COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain 
TIGRFAM ID[TIGR00444] MazG family protein 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.953323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGATGC GGATCACCGT GGTCGTCACC AGCCCCCGGG TGGCACCGGG AATTCTCACG 
GCCCGGGCGT GGGATGTCGT GCGCACGGTG CCCGTCCTGA CCGCGAGCCC CACGCATCCT
CAGCTCGCCG CGCTGCGGGC GGCCGGTGCG ACCGTCCTGA TCGTCGATCC CGGCGATCCC
GGCGATCCCG GCGGTCTGGC CGGCGCGGTG ACCCTCATCC GGGCCGCCCT GCCTGATCCC
GCGGCGGCCG AGGTCGCCTG GCTACCCGAT CCGCTTGCGC CGAACATCCT CGACTGCCTC
CTCGGCGAGG CGAGTGGATC CGGGGAACGC AACGATGCCG GCGTCGACGG GGTGACCGGG
GTCGAAGTGA CGAGCCTCGT GGCCACCAGG GAGCTGCCCG GCTCGAGCCT GTTGGACGCC
GTCGCGGTCA TGGACCGCCT GCGGTCGCCG GGCGGCTGCC CCTGGGACGC CGAGCAGACG
CACACCTCGC TCGCACCGTA CCTGGTCGAG GAGACGTACG AGGCGTATCA GGCGATCGAG
GACGGCGATC TCACCGAGCT GCGCGAGGAA CTCGGCGACG TGCTGATGCA GGTGCTCTTC
CACGCTCGGA TCGCCGCCGA GCGCGCCGAC GACGGCTGGG ACGTCGATGA CATCGCGGCC
GGTCTGGTGG CCAAGTTGAT CCGCCGTCAT CCGCACGTGT TCGGTGACGT GGTCGTGGAC
GGTCCGGCGA ACGTCGTCGC CAACTGGGAC GCGATCAAGG CGGTCGAGAA GGGGCGCGTC
TCGGTGACCG AGGGTGTCCC ACTCTCCCAA CCGGCGTTGT CCCTCGCGGC CAAGCTGCTG
AAGCGGGCGG CCGGCATCGG CGTCCCGGCG GATCTCGCGC TGACCGGGAC GGCCGCGTGG
GGTGTGACCG ACGCGGGCGA TCGGGTCACC GAGATCGCCG CCACGGCCGC CGTACTGGCC
CGGTCGGGCA CGGCGGGGGA CGACCTGATC GGTGATCTGC TCTTTGCCGC CGTCGCGCTC
GCCCGGACGG CCTCCGTCGA TCCGGAGCGG GCCCTTCGGG CGACCGCCCG TCGCTTCCGG
GACCGGCTGG CCGCCGTCGA GGGCACAGTC CGTGCCGAGG GCGCCGATCC GGCTTCCCTG
TCCGACACCC GCTGGCGCGC AATCTGGCGG CAGCTACCCG GGTAA
 
Protein sequence
MLMRITVVVT SPRVAPGILT ARAWDVVRTV PVLTASPTHP QLAALRAAGA TVLIVDPGDP 
GDPGGLAGAV TLIRAALPDP AAAEVAWLPD PLAPNILDCL LGEASGSGER NDAGVDGVTG
VEVTSLVATR ELPGSSLLDA VAVMDRLRSP GGCPWDAEQT HTSLAPYLVE ETYEAYQAIE
DGDLTELREE LGDVLMQVLF HARIAAERAD DGWDVDDIAA GLVAKLIRRH PHVFGDVVVD
GPANVVANWD AIKAVEKGRV SVTEGVPLSQ PALSLAAKLL KRAAGIGVPA DLALTGTAAW
GVTDAGDRVT EIAATAAVLA RSGTAGDDLI GDLLFAAVAL ARTASVDPER ALRATARRFR
DRLAAVEGTV RAEGADPASL SDTRWRAIWR QLPG