Gene Franean1_0156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0156 
Symbol 
ID5668581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp186804 
End bp187838 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content70% 
IMG OID641239085 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001504529 
Protein GI158312021 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0633468 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTCGG CGAACGTGAG CGAACCAACG TCGGTGGCGG CAGAGGCTGG GCAGGCCGTG 
CACCGGCGCA CCCGGCGCAC CCGACGGATC CGCCCGTGCC CGCGCCGTGC CGGCCGGCGC
CGGGTGGGCG CCGCGGCGGC GGCCATGCTA GCTCTCGCGC TGACCACTGC GGCCTGCGGC
GACGGCTCCG GCGGCGCCAG CCTTGGTTCG CTCACCGTCG CCGACGCCGG TTTCACCGAG
AGCAAGATCC TCGCCGACAT GTACGGCGAA CTGCTCACCA ACGCCGGCTA CAAGGTCGAG
CGGACCTCGG TGCAGAGCAC CGAGATCGCC CAGTCCTCCC TGGAGAGCGG GCAGATCGAC
GCCATGCCGC AGTATGTGGC GACCTACGCC GACCTGCTGA ACTCCCAGGT CAACGGCTCC
GGAGCCACCT CGGTCTCGTC GTCGGACCTG AACGCGTCGC TGGCCGGACT GCGCCGCCTC
GCGAAGGGGC TGGGCCTGAG CGTGCTCGAG CCGGCCGACG CGGTCGACCA GAATGCCTTC
GCGGTGAGCA GGTCCTTTGC CGCGGAGCAC CACCTGACGA CCCTGACCGA CCTGGGTGCC
AGTGGCCTGA CGGTGAAGCT CGCTGCGGGC GCCGAGTGCG CGACCCGCCC GTTCTGCCAG
CCCGGGTTGG AGAAGACCTA CGGGATCAAG ATCTCCGAGA TCGTCGAAAC CGGTGTGGCG
ACCGCCCAGA CCAAGGCCGC GGTACGGGAC AACACCGCGC AGCTCGGCCT GGTGCTCACC
ACCGACGCCA CCGTCAACGG CTACAACCTG GTCGTGCTGA CCGACGACAA GAAGCTCCAG
AACGCCGACA ATCTGGTGCC GATCGTGAAC ACCGACTCGC TCGCCCCGGA GATCACCTCG
GCACTGAACG CGCTGGCGCC GGTGCTCACG ACGGCGGACC TGGCCGAGCT GAACAAGAGG
GTGGACGCGG AGCGGGAGAA GTCCGAGGAG GTCGCGCACG ACTTCCTGGC CGAGAAGGGC
CTGCTGGACA GCTGA
 
Protein sequence
MGSANVSEPT SVAAEAGQAV HRRTRRTRRI RPCPRRAGRR RVGAAAAAML ALALTTAACG 
DGSGGASLGS LTVADAGFTE SKILADMYGE LLTNAGYKVE RTSVQSTEIA QSSLESGQID
AMPQYVATYA DLLNSQVNGS GATSVSSSDL NASLAGLRRL AKGLGLSVLE PADAVDQNAF
AVSRSFAAEH HLTTLTDLGA SGLTVKLAAG AECATRPFCQ PGLEKTYGIK ISEIVETGVA
TAQTKAAVRD NTAQLGLVLT TDATVNGYNL VVLTDDKKLQ NADNLVPIVN TDSLAPEITS
ALNALAPVLT TADLAELNKR VDAEREKSEE VAHDFLAEKG LLDS