Gene Franean1_1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1204 
Symbol 
ID5669617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1439371 
End bp1440387 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content77% 
IMG OID641240136 
Producthypothetical protein 
Protein accessionYP_001505564 
Protein GI158313056 
COG category[S] Function unknown 
COG ID[COG1426] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00102705 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.560918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCAG CGGACGAACC GGCGACCCCC GAGTCGGCGA GTCCCGGGTC GGCGGGCACC 
GCGCCGGCGA GCCCCGGGCC GGAGGTTCTC GAACCGGTGA CCGCGAAGCC GGCGGCCCCG
GGCCCCGCGG CCCAGATCCC GGCGCCCCAG CAACCGGTGC CTCCGGACAA GTCGGCTCCG
GGCGGGGCGG CTCAGGAGTC CCTGGGCTCG GTCATCGCCG CGGCGCGCCG GGCCGCGGGC
CTGACGATCG ACGACGTGAG CGACCGGACG AGAATCCGCG CCTCGCTGAT CGAGCGGATC
GAGCAGGACG ACTTCTCCGG CTGCGGCGGC TCCGTCTACG CTCGCGGGCA CCTGCGCAGC
ATCGCGACGA CGCTCGGGCT CGAGCCCGGC CCCCTGCTGG CGGTGTACGA CGCCGGGCAC
GAGCACGTGC CGTCCCCGGT CGTGGTCGCC TCGCCGGAGT TCGACCCGCT GCACGGTGGC
GCGGGGCGTA ACCGCGGCCT TGGCGGCTTC CGCTGGGCAC CGGCAATGAT CATCTCGCTG
GTGGTGGTGT GCGCGCTGGC GCTCGTCGCG CTGCTGCTGC CCTCGGGCGG GGGCGACTCG
GACGACTCGG CCACGCCCCG GCCGAGCGCG CCGCCGAGCG CCCCGGCCGC CACGGCACCC
CCCGGCCCGG CGCCGGCGCC CACCACCCCG CCGCCCCCCG GGGTGAACGT GCGGGTGGAG
GCCCGCGACG CGCAGAGCTG GCTGGAGGTC CGCGACGACA GCGACAAGGT GCTGTTCGCG
CAGCTCCTGC AGCGGGGCGA CAGCCGTGAG GTGTCCTCCG AGGGCGCCCT GGAGATCAAG
ATGGGCAACG CGGGCGCCGT CGACCTCTCC TGCAACGGCA CGAGCCTGGA CCGGGCCGGC
GGGCCGGGCG AGGTCGTGAC GATCCGGCTA GCGCTCGCCG CGACCGGCGG TGGCTGCACG
GTCGGCGGGC CGGGCACGGG CGGCCTCGCG GCCGGCGGGC TGGCGGGCTG GCGATGA
 
Protein sequence
MQSADEPATP ESASPGSAGT APASPGPEVL EPVTAKPAAP GPAAQIPAPQ QPVPPDKSAP 
GGAAQESLGS VIAAARRAAG LTIDDVSDRT RIRASLIERI EQDDFSGCGG SVYARGHLRS
IATTLGLEPG PLLAVYDAGH EHVPSPVVVA SPEFDPLHGG AGRNRGLGGF RWAPAMIISL
VVVCALALVA LLLPSGGGDS DDSATPRPSA PPSAPAATAP PGPAPAPTTP PPPGVNVRVE
ARDAQSWLEV RDDSDKVLFA QLLQRGDSRE VSSEGALEIK MGNAGAVDLS CNGTSLDRAG
GPGEVVTIRL ALAATGGGCT VGGPGTGGLA AGGLAGWR