Gene Franean1_3597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3597 
Symbol 
ID5671966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4259439 
End bp4260635 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content68% 
IMG OID641242483 
Productextracellular ligand-binding receptor 
Protein accessionYP_001507903 
Protein GI158315395 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCGCG GCTCGATCCG CCTCCTCGTC CCCCTGCTGG CCGCTTTATC GGTCGCCATG 
ACGGCATGTG GCGGCTCCGA TGACGGAGCG AGCAGTGACA GCGGAACCAT CAAGATCGGC
GCCTGGATCC CACTGACTGG CGCGCAGGCA TCCTCCGGCG TCCCTCAGGC GGAGGGCGCG
AAGGCGTACT TCGCATGGCT CAACGACAAC GGCGGTGTGA ACGGCCACCA GATCGAGTGG
ATCGTCAAGG ACAACGCCTA CGACCCGCAG CAGACCGTCC AAGCGGCCCG CGAGCTTGTC
GCCCAGGACC ACGTGGTCGC CATCGTGAAC GCCAACGGGA CCGCGCCGTC CGAGGCGGCG
TTCCCCTATG TCCTCAACCA GTCGAAGGTC CCGATCGTCG ACCACTACGG CGGGTCCGCC
GCCTGGTACG ACCCGCCCCG GCCGCTGCTG TTCGGCACCC AGACCCTCTA CGAGGACCAG
GCCGCGGCCA TGGCCACCTG GGCGGTCGAG TCCGGAGCGC GCAAGATCAT GGTCGTGCAC
GACGATCCGC AGGCATTCGC TAACGTCGCA AAGCAGATCG AACCCGCCGC CAGGCAAGCC
GACCCGAGCG TGTCGACCAC GATGCTTTCA GTAAAGCTCG GTACCACCGA CTTCGCTCCG
GCGGTTAGCC AGGTGCGCAA CGAGGCACCC GACGCCGTCA TGCTCATCAT GCCCACGCAG
GAGACAGCCG CTTACCTCAA GGAGGCGAAG CTGCAGGGCG TGCAGGTGCA GGCGTACGGA
TACTCGCCGA CGGCGTCCGC GACCACGGTG ACGCTGGCCG GAGCCGCCGC CGAGGGCTTC
CGTGCCGTGT CGGTGGTCGG CGTGCCGTCC CACACGAGCC CGCAGATGCA GCAGTTCCGC
GAGGTCATGG CGAAGTATGC GCCCGACCAG CCAGCGGACT TCTCGACTCT GCTCGGCTAC
GTTAACGCGG CCGTGTTCGC CGAGGTCGCC AAGACGATCG ACGGCCCGAT CACCTCGGAG
TCCATCGCCA ATGCGTACGA GAACGCGCAG GGCATCTCGA CCGGCGTCGC ACCCGACATG
AGCTACTCGG CCGACCAGCA TCTCGGCACC CGCCAGGTTC AGCGGACGTA TGTCAAGGAC
GGGCAGTGGG TGGCCGAGGG CGGGTTCTTC ACCCCGCCGG AGCGAGCAGC GGCCTGA
 
Protein sequence
MRRGSIRLLV PLLAALSVAM TACGGSDDGA SSDSGTIKIG AWIPLTGAQA SSGVPQAEGA 
KAYFAWLNDN GGVNGHQIEW IVKDNAYDPQ QTVQAARELV AQDHVVAIVN ANGTAPSEAA
FPYVLNQSKV PIVDHYGGSA AWYDPPRPLL FGTQTLYEDQ AAAMATWAVE SGARKIMVVH
DDPQAFANVA KQIEPAARQA DPSVSTTMLS VKLGTTDFAP AVSQVRNEAP DAVMLIMPTQ
ETAAYLKEAK LQGVQVQAYG YSPTASATTV TLAGAAAEGF RAVSVVGVPS HTSPQMQQFR
EVMAKYAPDQ PADFSTLLGY VNAAVFAEVA KTIDGPITSE SIANAYENAQ GISTGVAPDM
SYSADQHLGT RQVQRTYVKD GQWVAEGGFF TPPERAAA