Gene Franean1_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1008 
Symbol 
ID5669422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1189908 
End bp1191920 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content74% 
IMG OID641239937 
Producttranscription termination factor Rho 
Protein accessionYP_001505370 
Protein GI158312862 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.588232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGTCGG CCTCGTCGCC GACCTCCCAG CCCGCGTCCG CGCCGGCCTC CGTGGCCCCG 
GCCGCGCAGG GCGCTCCGCC CGCGGCGGCT CCCGCCGCGG ACGCGGTGAG CTCCGCGCCC
CGGGCGGCGG CCGGCTCGGT CGCTCCCGTC CAGGAGACGA GCGGTTCCAG CGGCGCCGAG
GCGACCACGG GCGGCTCCGT TTCCGGCTCC GCGGCAGAGA CGTCGTCGGC CCCGACCCGT
GTGCGCGGGC GGCGGGGCGC CTCTCGTGGC GTCACGAGCC CCGCGGGCGA GCAGCAGACA
CTGCCGACCG GTCCGGGGGG TGCGGAGTCG TCCGACGACA CCGCCCGCCC GCAGGCCGCC
GCGGCTGCCG CGGACGGCCC GGCGGTGACC GCGCCCGCCG CGGCGCCCGT CTCCGTCCGC
GCCGGGACGA ACGGCTCCGA GAGCCCCACG CGCGGTCGTG ACGACCGCCG CGAGCGCTCC
GGTGACCGGG ACCGCTCCGG CGGCGACCGG GACCGTTCGG GTGACCGTGA CCGTTCCGGC
GGCGACCGGG ACCGTTCCGG CGACCGCCAG GGCCGCTCAC AGCCGGCCGG CGACCGCGAC
CGGAACGACC GTGCCGACCG CAGCGACCGT GCCGACCGTG CCGACCGTGC CGACCGTGCC
GACCGCAGCG ATCGTGCCGA CCGCAGCGAC CGTGCCGGTT CAGACCGGTC CAGGACGGTC
GAGCGCACCC AGCCGGGCGA CCGTGCACCC CAGGGCGGCG TCCAGGACGA CGACGAGTTC
GGCAGCCGGC GCCGCGGCCG GTTCCGGGAG CGCGGCCGCA ACCGCGGCCG CGGCGGGCAG
GGCGGCACGA CCGAGACCGA GCCGACGGTG CGCGAGGACG ACGTCCTCGT CCCGGTGGCC
GGCATCCTCG ACGTGCTGGA CAACTACGCC TTCGTCCGCA CGAGCGGCTA CCTGACCGGC
CCGACGGACG TGTACGTGAG CCTCGCCCAG GTCCGTCGCA ACGGCCTGCG CCGCGGCGAC
GCGATCACCG GGGTGGTGCG CGCGCCGCAG GAGGGCGAGC AGCGCCGCGA CAAGTACAAC
GCGCTGGTCC GGCTGGACAC GATCAACGGG ATGGAGCCGG AGGAGGCCCG CGGCCGGCCG
GAGTTCCACA AGCTCACCCC GCTCTACCCG CAGGACCGCC TGCGGCTGGA GACCGAGCCG
CACATGATGA CCACGCGGGT CATCGACCTG GTGATGCCGA TCGGCAAGGG CCAGCGCGCG
CTCATCGTGA GCCCGCCGAA GGCCGGCAAG ACGATGGTGC TCCAGTCGAT CGCCAACGCG
ATCACCACGA ACAACCCGGA ATGCCACCTC ATGGTCGTCC TCGTCGACGA GCGGCCCGAG
GAGGTCACCG ACATGCAGCG GTCGGTGAAG GGCGAGGTCG TCGCCTCGAC CTTCGACCGC
CCGCCGGCCG ACCACACCAA CGTCGCCGAG CTGTCCATCG AGCGGGCCAA GCGGCTCGTC
GAGCTCGGCC ACGACGTGGT CGTGCTGCTC GACTCGATCA CCCGGCTGGG TCGCGCCTAC
AACCTCGCGG CGCCGGCGTC GGGGCGCATC CTGTCCGGTG GTGTCGACTC GACGGCGCTC
TACCCGCCGA AGCGGTTCCT CGGCGCGGCG CGCAACATCG AGAACGGCGG CTCCCTGACG
ATCATCGCGA CCGCGCTGGT CGAGACCGGT TCGACGATGG ACACGGTGAT CTTCGAGGAG
TTCAAGGGCA CCGGTAACGC CGAGCTCAAG CTGGACCGGA AGATCGCCGA CAAGCGGGTC
TTCCCGGCGG TGGACGTCGA CGCCTCCGGC ACCCGCAAGG AGGACATCCT GCTGGCCCCC
GACGAGCTTG CGATCATGCA CAAGCTCCGC CGGGTCCTGC ACACCCGGGA GCCGCAGCAG
GCGCTCGACC TCCTGCTCGA CCGGCTGAAG CAGACCAGGA CGAACTACGA GTTCCTGATG
CAGATCGCGA AGACGGCACC GCCCCAGGAC TGA
 
Protein sequence
MPSASSPTSQ PASAPASVAP AAQGAPPAAA PAADAVSSAP RAAAGSVAPV QETSGSSGAE 
ATTGGSVSGS AAETSSAPTR VRGRRGASRG VTSPAGEQQT LPTGPGGAES SDDTARPQAA
AAAADGPAVT APAAAPVSVR AGTNGSESPT RGRDDRRERS GDRDRSGGDR DRSGDRDRSG
GDRDRSGDRQ GRSQPAGDRD RNDRADRSDR ADRADRADRA DRSDRADRSD RAGSDRSRTV
ERTQPGDRAP QGGVQDDDEF GSRRRGRFRE RGRNRGRGGQ GGTTETEPTV REDDVLVPVA
GILDVLDNYA FVRTSGYLTG PTDVYVSLAQ VRRNGLRRGD AITGVVRAPQ EGEQRRDKYN
ALVRLDTING MEPEEARGRP EFHKLTPLYP QDRLRLETEP HMMTTRVIDL VMPIGKGQRA
LIVSPPKAGK TMVLQSIANA ITTNNPECHL MVVLVDERPE EVTDMQRSVK GEVVASTFDR
PPADHTNVAE LSIERAKRLV ELGHDVVVLL DSITRLGRAY NLAAPASGRI LSGGVDSTAL
YPPKRFLGAA RNIENGGSLT IIATALVETG STMDTVIFEE FKGTGNAELK LDRKIADKRV
FPAVDVDASG TRKEDILLAP DELAIMHKLR RVLHTREPQQ ALDLLLDRLK QTRTNYEFLM
QIAKTAPPQD