Gene Franean1_0461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0461 
Symbol 
ID5668882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp544538 
End bp546100 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content65% 
IMG OID641239392 
Productextracellular solute-binding protein 
Protein accessionYP_001504830 
Protein GI158312322 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCAAGA GACGTTTCCT CACCCCCGCC GTCGCCGGCC TGGTGACCCT GGTCTTGGCC 
GCCTGTGGTG GCGGCTCCGG CTCCTCCCCG TCCACCCCCG CCACCGGTGA ACCGATACCG
GGAGGCAAGG CAACGGTGCT CATGTTAAGC GACCCGACCA CTCTGGACCC GGCCCGTCTC
GGCAATGCCT ATGCGATCAC TCCTGTCCTG GGGAATGCCC TGTACGGGAC GTTACTGACC
GACGACAAGA AGACCGGCGA GATCCAGTAC TCGCTGATCG AGTCATTCGA GACAACCGAC
CAGGGCGCCA CATTCACCCT CAAGCTGCGT CCGGACCTGG TGTTCTCTGA TAGCACCCCC
TTCGACGCCG AGGCCGTCAA GTTCAACTGG GACAGGATGA GAGACCCTGC CACCGGTTCG
ACCTCCATCG CGGAGGCATC GATGATTAAG GCCATCAAGG TGGTGGACGA CGTCGTTCTT
GAGGTCACCA TGGCCACCCC GGTACCCAGC TACGCCTATT CGATCTTGAC CAGCTCCATG
AACTGGGTCG CCTCACCCAC GGCCCTACGC AAGGGGGCGG AGGCCTTCAA CGAGAACCCG
ATCGGCGCCG GACCGTTCAC CCTGCAGCGT TGGAACAGGC AGGCCACCAT CGAACTGATC
AAGAATCCCC GCTACTGGGA CGCCCCCAAG CCCTACCTCG ACGGGCTCAC GCTGCGCGCG
GCCACCGACT CCGGCCAACG GCTCAACACC GTGGTCTCCG GTGGCGCGGA CGTGGCCGTC
GACTCGAACT GGCTCAACAT CGCCAAGGCC CGGGAGCAGG GCCTGACCGT CGACCTGCAG
GAACTCAACG GCGGCATCCT CATCGCCCTG AACATGCGCC GAGCCCCCTT CGACGACATC
CGCGCTCGCC GCGCCGTCTC CGCCGCCCTC GACCTCGATG CACTCAACCT CGCCGTCTAC
AGCGGCGAGG GGAAGATGGT CGACACCCTG TTCACCAAGG GCTCGTTATT CTATTCCGAC
ACTCCCCTGC GCAAGCACGA CAAGGAAACC GCGCAACGGC TCTTCGACGA GCTGGCCGCG
GACGGTAAGC CGGTGTCATT CACCTTTTCC GCCTACCCCA CCACCGAGAA CCGGACGACG
GCCGAGAACA TCCAGGCCCA GCTCGGCGCC TTCCGGAACG TCAAGGTCGA AATCGCGACC
GTCGATTTCT CCCAGCTCGC CAAAGTGCGG TCGCAGCACG ACTTCGACAT GATCGTCTCC
GGCGGGTTCT TCCGTGATCC CGAGCCCGGG CTGTGGACGG CGTTCCACAG CAGCTCGGTG
GCCAACCAGA CCGGCGTCGA CGATCCGACG CTCAATGAGG CGCTGCTGGC CGGACGGACG
GAGATCACCC AGGAGGCCCG CGAGAAGGCC TACGCCACCG TCCAGCAGCA GCTAACCGAT
CTGGTCCCGG TGATCTACCT CGCGCGGGTG GCACCCAGCG CGATTGCGAA CACGAACGTC
GGCGGTGTCA TCCAATACGG CAACGGTTCC CTGCGACCAG AGGAGCTGTG GATCAAGAAG
TAG
 
Protein sequence
MRKRRFLTPA VAGLVTLVLA ACGGGSGSSP STPATGEPIP GGKATVLMLS DPTTLDPARL 
GNAYAITPVL GNALYGTLLT DDKKTGEIQY SLIESFETTD QGATFTLKLR PDLVFSDSTP
FDAEAVKFNW DRMRDPATGS TSIAEASMIK AIKVVDDVVL EVTMATPVPS YAYSILTSSM
NWVASPTALR KGAEAFNENP IGAGPFTLQR WNRQATIELI KNPRYWDAPK PYLDGLTLRA
ATDSGQRLNT VVSGGADVAV DSNWLNIAKA REQGLTVDLQ ELNGGILIAL NMRRAPFDDI
RARRAVSAAL DLDALNLAVY SGEGKMVDTL FTKGSLFYSD TPLRKHDKET AQRLFDELAA
DGKPVSFTFS AYPTTENRTT AENIQAQLGA FRNVKVEIAT VDFSQLAKVR SQHDFDMIVS
GGFFRDPEPG LWTAFHSSSV ANQTGVDDPT LNEALLAGRT EITQEAREKA YATVQQQLTD
LVPVIYLARV APSAIANTNV GGVIQYGNGS LRPEELWIKK