Gene Arth_3083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3083 
Symbol 
ID4444316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3453965 
End bp3454951 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content62% 
IMG OID639690910 
ProductABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic components-like protein 
Protein accessionYP_832562 
Protein GI116671629 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.261971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGTC GATTTATTGC CGTTCTGGCA GCCGCGTCTG TTCTGGGATT GTCGGCCTGC 
GGAAGTGGCT CGCCCAGTTC CACGGGAGGC GGAACTGCCT CGGGCACCGC CGGTGGAGGT
TCGGACCTGA CGAAGGTCAG CGTCGGTGTC ATTCCCATCG TCGACTGCGC CCCCATCTAC
CTCGGAGACA AGCAGGGCTT CTTCAAGGAA GAGGGCCTCC AGCTGGACAT CCAGACCGCA
ACCGGCGGTG CGGCGATCGT CCCCGGTGTC GTCAGTGGCA GTTTCGACTT CGCGTTCTCA
AATCTGATCT CGGTGATGGT CGCCAAGGAC AAGGGCCTTG ACCTGAAATT CGTCGCCAAC
GGTGCGTCCA CCACCGGGGA AAAGGGCAAG GACATCGGCG GCGTCGTCGT GCCGGCAGGC
TCAAGCATCC AGTCCGGGAA AGACCTCGCA GGCAAGACCG TTTCGGTGAA CAACCTCTCC
AACATCGGCG ACACCACCAT CAAGTCGGTC GTCGAAAAGG ACGGTGGTGA CCCCAAGAGC
GTGAAGTTCG TCGAGGTGGC CTTCCCGGAC GCCCCGGCGG CCCTGGCCAA CAAGCAGGTG
GATGCGGCGT GGATCCTTGA GCCCTTCCTG TCCAAGGCCG TGGCTGAAGG CGGCAAAGTG
GTTTCCTGGA ACTTCGTCGA GATGAGCCCG GAGCTGGACA TCGCCGGCTA CTTCACCAAG
GGAGACACCA TCAAGGGCAA GGCTGAGCTC ACGCAGAAGT TCACCCGTGC CATGAACAAA
TCGCTTGAAT ATGCGCAGCA GCACCCGCAG GAGGTCCGCG ACATCGTGGG CACCTACACG
AAGATCGACG AGGCTGCCCG GGCCAAGATC GTGCTGCCGC GGTACCGGGT CGACTTCAAC
AAGGATGCGT TCAAGACCCT CGGCGACGCC GCCGCCAGCT ACGGCACGCT GACCAAGGCT
CCGAACGCAG ACGAACTCCT CCCGTGA
 
Protein sequence
MKRRFIAVLA AASVLGLSAC GSGSPSSTGG GTASGTAGGG SDLTKVSVGV IPIVDCAPIY 
LGDKQGFFKE EGLQLDIQTA TGGAAIVPGV VSGSFDFAFS NLISVMVAKD KGLDLKFVAN
GASTTGEKGK DIGGVVVPAG SSIQSGKDLA GKTVSVNNLS NIGDTTIKSV VEKDGGDPKS
VKFVEVAFPD APAALANKQV DAAWILEPFL SKAVAEGGKV VSWNFVEMSP ELDIAGYFTK
GDTIKGKAEL TQKFTRAMNK SLEYAQQHPQ EVRDIVGTYT KIDEAARAKI VLPRYRVDFN
KDAFKTLGDA AASYGTLTKA PNADELLP