Gene Arth_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1921 
Symbol 
ID4445540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2163115 
End bp2164179 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content63% 
IMG OID639689731 
Productinner-membrane translocator 
Protein accessionYP_831403 
Protein GI116670470 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.748893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGGA CCTTGTCGTC CCTGCCGCCA AAGAGCGGCG CGCCCTCCCC ATCGCCCTCA 
GCAAGTATCA GCTCCCGACT GCGGGGCAGC GCCCTCCGCG CCCTGCCAAA GAGCTACCTT
ATCCTCGTGC TTCTCGCGAT CATCGCGGTG GGCTACTACG TCTCCGACGA CTTCTTGACG
TTCCGCAATG CCGAGAACGT GATCACGGCC GCCTCGATCG TCGTCGTGCT GGCGATCGGC
CAGTATTTTG TCATCCTGAC GGGCGGAATT GACCTCTCCG TAGGCTCGAT CCTCGCCATG
TCCACGGTAA TCACGGCACT GACCTTGCAG GCCGGGATGC CCGCCGGCGC GTCCGTTGTG
TTCGTGCTTG CCTGCTGCGC TGCAGCCGGA CTGATCAACG GCATCCTCGT CGTTTGGTTG
AACATCCCCC CGTTCATCGC CACGCTCGCC ATGATGAGCG CCGTCAAGGG CTTCAGCTAC
ATCATCCAGT CAACAAGCCT GATCGAGATC CGCGACCAGT GGATCATTGA GACCTTCTCC
CGCGGCAGCT TCCTCGGGAT CCGCCACCCC GTCCTGATCT TCATTGTCGT GGCCGTGGCC
GCGGCGCTCG TGGCCAAATA CACTACGTTC GGCCGCTCGC TCTACGCTAT TGGCGGCAAC
CCTGAAGCCT CGCGGCTTTC GGGCCTGCCG GTCGCACGGA ACCTGATCAT CACCTACACC
ATCTCCGGGG TGCTCGCCGG CCTGGCAGGC CTCATCGCCG CCTCCCAGCT GAGGCAGGGT
AGCTCGCTCA TCGGTGTAGG TTACGAGCTC GACGCCATCG CGGCCGTCGT CGTGGGCGGC
GCCTCCCTCA TGGGCGGAAA GGGTGATCCC CTGAACGCAG TGATCGGGGT GTTCATCCTG
ACCACCATCA TCAACATCAT GAACCTCGTC GGGATCTCCT CCGAGCCGCA GCTGGTCATC
AAGGGCGCGG TCATCATCCT CGCCGTCTTC CTCTCCAGCG CAGGCGGCGT CCAGAGGATC
TCCGGATTCT TCTCGAAACA TTTCCCGCGC ACCCGCGCCG TCTGA
 
Protein sequence
MSRTLSSLPP KSGAPSPSPS ASISSRLRGS ALRALPKSYL ILVLLAIIAV GYYVSDDFLT 
FRNAENVITA ASIVVVLAIG QYFVILTGGI DLSVGSILAM STVITALTLQ AGMPAGASVV
FVLACCAAAG LINGILVVWL NIPPFIATLA MMSAVKGFSY IIQSTSLIEI RDQWIIETFS
RGSFLGIRHP VLIFIVVAVA AALVAKYTTF GRSLYAIGGN PEASRLSGLP VARNLIITYT
ISGVLAGLAG LIAASQLRQG SSLIGVGYEL DAIAAVVVGG ASLMGGKGDP LNAVIGVFIL
TTIINIMNLV GISSEPQLVI KGAVIILAVF LSSAGGVQRI SGFFSKHFPR TRAV