Gene Arth_1154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1154 
Symbol 
ID4446354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1251944 
End bp1253293 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content63% 
IMG OID639688961 
Productextracellular ligand-binding receptor 
Protein accessionYP_830648 
Protein GI116669715 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.443442 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCAC TCCCCCAGGC GGCGCCGCGA GTCGCTAAGC TCACAGCGCT TAGCATCGGC 
GTCGCCCTTC TGGCTACGGC TTGTGGCGGC TCGTCCACCC CGAGTTCGAC AGGCTCCACC
ACTCCGGCGG CATCCGGAAT CGCCTGCCCG GCGCCGAGCG CTAGCGGCGG CGCCACCACC
GCAGCGGGCG CGGGCGGCTC GGTCCCGGCC TCTACCACTA CTACGGATAC TCCGCTCAAG
ATCGGCTCGC TTCTGCCTAC CACGGGCTCG CTGGCGTTCC TCGGGCCGCC CGAAATTGCC
GGTGTGAACC TGGGCATCAA GGAAGTCAAT GACGCAGGCG GCGTCCTGGG CAAGCCCGTC
GAAGTGATCC ACCGCGACTC CGGTGACACC AAGACCGACA TTGCAACGCA GTCCACCACA
GCGCTGCTGG GCAGCGGCGT CAGCGCCATT ATCGGCGCTG CATCATCGGG GGTTTCCAAG
ACCGTCATCA ACCAGATCAC CGGTGCCGGT GTCATCCAGT TCTCGCCCGC GAACACGTCT
CCCGACTTCA CCACCTGGGA TGACAAGGGC CTCTACTGGC GCACGGCTCC CTCCGATGTG
CTGCAGGGCA AGGTGCTCGG CAACTACATG GCTACCTGTG GCGCACAGAC CGTCGGCATG
ATCGTTCTGA ACGATGCGTA CGGCACCGGC CTGGCCAAGA ACGTCAAGTC TGCGTTTGAA
GCTGCCGGCG GCAAGGTTGT TGCCGAGGAG CTCTTCAACG AGGGCGACTC GCAGTTCAGC
AGCCAGGTGG ACAAGGTCAT TGCAGCCAAG CCGGATGCGA TTGCCCTGAT CACCTTCGAC
CAGGCTAAGA GCATCGTGCC CCTGATGACC GGCAAGGGCA TCAAGGCGAC CCAGATGTTC
CTGGTTGACG GCAACACCTC GGACTACAGC AAGGACTTCC AGGCGGGAAC GCTGAAGGGC
GCCCAGGGCA CCATCCCGGG CACGTTCGCC AAGGACGACT TCAAGAAGAA GCTGCTGGCA
ATCGACCCGG CGCTGAAGGA CTACAGCTAT GCAGGCGAGT CGTACGACGC CGTCAACCTG
ATCGCGCTGG CTGCGGAAGC CGCTAAGAGC ACCAAGGGTA CCGACATCGC CAAGCAGCTC
AAGGCAGTCT CCGAAAGCGG CGAGAAGTGC AACGACTTCC CGTCCTGCGT CACGCTGCTC
CGCAACGGCA AGGACATCGA CTACGACGGC CAGTCCGGTC CGGTGACCTT CTCCGACGCC
GGTGACCCGA CGGAAGCCTA CATCGGCATC TACGAGTACC AGGATGACAA CACCTACAAG
CCGTCGAAGG AAGAATTCGG CAAGCTGTAA
 
Protein sequence
MISLPQAAPR VAKLTALSIG VALLATACGG SSTPSSTGST TPAASGIACP APSASGGATT 
AAGAGGSVPA STTTTDTPLK IGSLLPTTGS LAFLGPPEIA GVNLGIKEVN DAGGVLGKPV
EVIHRDSGDT KTDIATQSTT ALLGSGVSAI IGAASSGVSK TVINQITGAG VIQFSPANTS
PDFTTWDDKG LYWRTAPSDV LQGKVLGNYM ATCGAQTVGM IVLNDAYGTG LAKNVKSAFE
AAGGKVVAEE LFNEGDSQFS SQVDKVIAAK PDAIALITFD QAKSIVPLMT GKGIKATQMF
LVDGNTSDYS KDFQAGTLKG AQGTIPGTFA KDDFKKKLLA IDPALKDYSY AGESYDAVNL
IALAAEAAKS TKGTDIAKQL KAVSESGEKC NDFPSCVTLL RNGKDIDYDG QSGPVTFSDA
GDPTEAYIGI YEYQDDNTYK PSKEEFGKL