Gene Franean1_3159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3159 
Symbol 
ID5671536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3721799 
End bp3724198 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content66% 
IMG OID641242054 
Productglycoside hydrolase family protein 
Protein accessionYP_001507474 
Protein GI158314966 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGGGATG AGGTCAGGCT CCCTCATGAC GCGATGGTGG CTCGTGAACG TAACCGTTCC 
GACGCCGCGG CGGGCCAGCG CGGCTTCTAC CCGGCTGGGG CGTACCAGTA CAGGAAGTCG
TTCTTCGTGC CGGAGGAGTA CCGGAACCGG CGCGTCACCT TCGAGTTCGA GGGTGTCTAC
CGGAGCGCCA GGGTTTTTCT CAATGGTGGC CTCGCCGGAC AGCATGCCTA CGGCTACTCC
CACTTCTACG TCCGCGCCGA CCATTTCCTG AAGTACAACG AGGACAACGA GATCCTGGTG
GAGGCACACA GCGCCGATGA CACCCGCTGG TACTCGGGTG GTGGCCTGTA CCGCAACACG
AAGCTCATCG TCGGCGATCT CGTGCACATC GGCCTGGACG GGGTGAAGGT CACGACGCCG
GCCATCGACG CGGACCTTGC CCTAGTCGCG GTGGCGACGC AGGTGCACAA CGAGTCCTCG
GTCACCCGGA CCGTCGAGGT GACCACCGAG ATCGTGGGCG CCGACGGTGT CGTCGTGGTC
CGTGACGTCG CCCCGCTCAC CCTGTTCACG GACGATCCGG TCACCGTACG CCAGCGGCTG
CCGGTGCCGC GGCCGCAGCT GTGGGGAGTG GAGCACCCGC ACCTCTACAC CTGCCGGACC
AGAGTGACGG CGGACGGAGA GCTCCTGGAT GAGGAGGCCA CCCGTTTCGG TGTCCGGTCG
CTCACCGTCG ATCCGCACTG GGGTCTGCGC ATCAATGGTG GGGTGGTGAA CCTCCGCGGC
GCGTGCATCC ACCACGACAA CGGCGTGATC GGTGCCGCCA CGATAGACCG GGCCGAACAG
CGCCGGGTCG AGATCCTGAA ACAGGCCGGT TTCAACGCCA TCCGTAGTTC CCACAATCCG
ATCAGCAAGG CCCTCCTCGA CGCCTGCGAC CGGCTCGGCA TGCTCGTAAT CGACGAGTTG
TTCGACGCAT GGACCCGGTC GAAGGTCGCG CAGGACTACG CCCTCGACTT CCCCCTCTGG
TGGAAGTCGG ACGTGCAGGC GATGGTCGAC AAGGACTTCA ACCACCCCTG CGTGATCCTC
TACTCGATCG GGAACGAGAT CCCGGAGACA GGCACCGCTG CCGGCGCGGC GATCAGCCGC
CAGCTCGCCG AGAGGATCCG AGCCATCGAC GACACCCGTT TCGTCACGAA CGGCGTCAAC
GGCCTTCTCG CCGGCGGCCC CGAGCTGCTC GCGTCGTTCT CCAGCGGTGC TCGGGAGAAA
AGCAGCGAGG CGGGCGAGGC GGTGGACGTC AACGCGTTCA TGAACAGGTT CCGCGAGTTT
ATGCCGATCC TCATGGCCTC CGAAATGGTC GGTTCGAAGA CCGCGGAGTC GATGGCCTGC
CTGGACGTCG CCGGCTACAA CTACCTGGAG TCACGGTACG AGCTGGACCG AACACTGTTC
CCGAACCGGG TGATCGTGGG GACCGAGACC TACCCGTCGG AGATCGACAG GAACTGGCGG
CTCGTCCAGG ACAACAGCCA CGTCATCGGT GACTTCACCT GGACCGGCTG GGACTATCTC
GGCGAACCAG GAATCGGGCG GATCGAGTAC CAAGGCGACG AGGAAAACGC CAGCACCTCC
CCATCCCACG GCAGTTATCC GTGGCTGACC GCGTGGTGCG GCGACATCGA CATCACCGGC
CACCGCCGAC CGGCCTCCTA CTACCGCGAG ATCGTGTTCG GCCTACGCAG CGAGCCCTAC
ATAGCCGTGC ACCGCCCCGA CCGCTACGGC CAGCCAGTTA CGGTGGCGAT GTGGTGGTCG
TGGAGTGACG CGATCTCCAG CTGGTCCTGG GACGGCCACG AGACCAGACC GGTGCGGGTG
GAGGTTTACT CGGCCGCCGA CGAGGTCGAA CTTTTGGTCA ACGGCCGGCT GATCGGTACC
GTCCCGGCGG GGGAGAAGAA CCGGTTTAAG GCCGAGTTCG ACACCGTTTA CGAACCCGGC
GAAATCGTCG CTGTCGCCTA CACCGCTGGC CGCGAGACCG GACGCACCCT GCTGCGCTCG
GCGACCGGCG AGGTCCGCCT CGCCGTCGCC GCCGACCGCA CCGACATCGT CGCCGACGAC
ACCGACCTTG CCTACATCGC CATCACCCTC GTCGACGAGG CCGGCAACCT CTACAACACC
GCCGACCGCA CGGTCGCCGT CGAGGTGGCG GGACCCGGCG TGCTGCAGGG CTTCGGTAGC
GCGGACCCGA AAACGGAAGA GAACTTCTTC GATACCACCC GCGCCACCTT CGACGGCCGG
GCACTCGCCG TCATCCGCCC CACCGCCCCC GGCACGATCA CCGTGACCCT CACCGCGCAG
GGATGCGAGC CCTCCACTAT CCGCATCGAA GCCGAACTTA CGGCACGCTC AGGCGAATGA
 
Protein sequence
MWDEVRLPHD AMVARERNRS DAAAGQRGFY PAGAYQYRKS FFVPEEYRNR RVTFEFEGVY 
RSARVFLNGG LAGQHAYGYS HFYVRADHFL KYNEDNEILV EAHSADDTRW YSGGGLYRNT
KLIVGDLVHI GLDGVKVTTP AIDADLALVA VATQVHNESS VTRTVEVTTE IVGADGVVVV
RDVAPLTLFT DDPVTVRQRL PVPRPQLWGV EHPHLYTCRT RVTADGELLD EEATRFGVRS
LTVDPHWGLR INGGVVNLRG ACIHHDNGVI GAATIDRAEQ RRVEILKQAG FNAIRSSHNP
ISKALLDACD RLGMLVIDEL FDAWTRSKVA QDYALDFPLW WKSDVQAMVD KDFNHPCVIL
YSIGNEIPET GTAAGAAISR QLAERIRAID DTRFVTNGVN GLLAGGPELL ASFSSGAREK
SSEAGEAVDV NAFMNRFREF MPILMASEMV GSKTAESMAC LDVAGYNYLE SRYELDRTLF
PNRVIVGTET YPSEIDRNWR LVQDNSHVIG DFTWTGWDYL GEPGIGRIEY QGDEENASTS
PSHGSYPWLT AWCGDIDITG HRRPASYYRE IVFGLRSEPY IAVHRPDRYG QPVTVAMWWS
WSDAISSWSW DGHETRPVRV EVYSAADEVE LLVNGRLIGT VPAGEKNRFK AEFDTVYEPG
EIVAVAYTAG RETGRTLLRS ATGEVRLAVA ADRTDIVADD TDLAYIAITL VDEAGNLYNT
ADRTVAVEVA GPGVLQGFGS ADPKTEENFF DTTRATFDGR ALAVIRPTAP GTITVTLTAQ
GCEPSTIRIE AELTARSGE