Gene Franean1_0830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0830 
Symbol 
ID5669246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp970323 
End bp972332 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content75% 
IMG OID641239759 
Producthypothetical protein 
Protein accessionYP_001505194 
Protein GI158312686 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.56502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCAACA AGCTCGCGGA GCAGACGTCC CCCTACCTGC TGCAGCACGC CGACAACCCG 
GTCGACTGGT GGCCCTGGGG GCCCGAGGCG TTCGCCGAGG CCACCACCCG CGGGGTGCCG
GTGTTGCTCT CCGTGGGCTA CGCGGCCTGC CACTGGTGCC ACGTCATGGC GCACGAGTCC
TTCGAGGACC CCGAGATCGC TGCGTACATG AACCAGCACT TCGTCAACAT CAAGGTCGAC
CGGGAGGAGC GCCCCGACGT CGACTCGGTG TACATGGACG TCACGGTCGC GCTCACCGGG
CACGGCGGCT GGCCGATGAC GGTGTTCCTC ACCCCGGCCG CCGAGCCGTT CTTCGCCGGC
ACCTACTTCC CACCCCGGCC GATGCGGGGC AGCGCGTCGT TCCCCCAGGT CATGGCGGCC
ATCGTGGACG CGTGGACGGC GCGCCGGGCG GAGGTCGAGC AGTCCGGGGC GGACATCGCC
CGCCAGCTGG CGGAGGCGGT GGCGCCCGGT GGAGCGGCGT CCGGCGGTGG GGCCACAACA
CAGATCACGG CCGACCTTCT CGACCGGGCG GTCGCCGGGC TGGCCGACCG GTTCGACTCC
GTGCACGGCG GGTTCGGCGG GGCGCCGAAG TTCCCACCGT CGATGGTCGC CGAGATGCTG
CTGCGGAGCT GGGCCCGCAC CGGGGACGGC CGGGCCCTGG GAATGGTGCG CGAGACCTGC
GAGCGGATGG CGCGCGGCGG GATGTACGAC CAGCTCGGCG GCGGCTTCGC CCGCTACAGC
GTGGACGAGT CGTGGACCGT CCCGCACTTC GAGAAGATGC TGTACGACAA CGCCCAGCTG
CTGCGGGTCT ACCTGCATCT GTGGCGGGCC ACGGGCCTGC CGCTGGCCGA GCGGGTGGTG
CGCGAGACCG CGGCCTTCCT GCTCGCCGAC CTGCGTACCC CCGAGGGGGG CTTCGCCTCG
GCGCTCGACG CCGACGCCGT GCCGGCGGGC AGCCCCGGCG GCCATCCCGA GGAGGGCGCG
AGCTACTCGT GGACGCCCGC CCAGCTGGTG GACGTGCTCG GCCCCGACGA CGGTGCCCTG
GCCGCGCGGG TCCTGGGGGT CACCGCGGAG GGGTCGTTCG AGCACGGGAC GTCCGTGCTG
ATGCTGCCGG CCGACCCGGA GGACCCCGCC AGGTTCGCCC GGGTCCGGGC GGCGCTGGCC
GCCGCGCGCG CCACCCGCCC GCAGCCGGCC CGGGACGACA AGATCGTCGC GGCCTGGAAC
GGTCTGGTCA TCGGGGCGCT CGCCGAGGCG GGCGCGCTGC TGGGCGAGCC GTCCTGGGTG
GGGGCCGCCG AGCGCGCCGC CGAGCTGCTG CGGGACGTCC ACCTGCACGA GGGCCGGCTG
TGGCGGACCA GCCGGGACGG CCGGCGCGGC CCCAACGCCG GTGTGCTGGA GGACTACGGG
TGTGTCGCCG AGGGCTTCCT GACGCTGCAC CAGGTGACAG GCGCCGCGGG CTGGCTCGCG
CTCGCCGGCG AGTTGCTCGA TGTGGTCCGG GCGCGGTTCG CGGCGCCGGA CGGCGGTTAC
TTCGACACCG CGGACGACGC CGAGGCGCTG CTGCGCCGGC CGCGGGACGC CTCCGACTCG
GCGACCCCCT CGGGCCAGGC GGCCGTCGCC GGCGCCCTGC TGACCTACGC GGCGCTCACC
GGCTCCGCCG ATCACCGGGA CAGCGCGCGG GCGACCGTCG AGCAGCTCAC CCCGCTGTTG
AGCCGGGACG CCCGTTTCGC CGGCTGGGCG GGTGCCGTCG CGGAGGCCCT GCTGGCCGGG
CCGGCCGAGG TCGCGGTGGT CGGCCGGCCG GATCTGGAGC GTCTGGCCAG GCTCGGCACC
GCTCCCGGCG CGGTTGTCGT CACCGAGGGC CCGCTGACCG CGGGCCGGGA CGAGCCGGCC
GTCTACATCT GCCGGGACTT CGTCTGCGAG CTCCCGGCGC GGACCCCGGA GGAGGTCCGC
GCGCGGCTGG GAGTGCGGCT TCCAGCCTGA
 
Protein sequence
MPNKLAEQTS PYLLQHADNP VDWWPWGPEA FAEATTRGVP VLLSVGYAAC HWCHVMAHES 
FEDPEIAAYM NQHFVNIKVD REERPDVDSV YMDVTVALTG HGGWPMTVFL TPAAEPFFAG
TYFPPRPMRG SASFPQVMAA IVDAWTARRA EVEQSGADIA RQLAEAVAPG GAASGGGATT
QITADLLDRA VAGLADRFDS VHGGFGGAPK FPPSMVAEML LRSWARTGDG RALGMVRETC
ERMARGGMYD QLGGGFARYS VDESWTVPHF EKMLYDNAQL LRVYLHLWRA TGLPLAERVV
RETAAFLLAD LRTPEGGFAS ALDADAVPAG SPGGHPEEGA SYSWTPAQLV DVLGPDDGAL
AARVLGVTAE GSFEHGTSVL MLPADPEDPA RFARVRAALA AARATRPQPA RDDKIVAAWN
GLVIGALAEA GALLGEPSWV GAAERAAELL RDVHLHEGRL WRTSRDGRRG PNAGVLEDYG
CVAEGFLTLH QVTGAAGWLA LAGELLDVVR ARFAAPDGGY FDTADDAEAL LRRPRDASDS
ATPSGQAAVA GALLTYAALT GSADHRDSAR ATVEQLTPLL SRDARFAGWA GAVAEALLAG
PAEVAVVGRP DLERLARLGT APGAVVVTEG PLTAGRDEPA VYICRDFVCE LPARTPEEVR
ARLGVRLPA