Gene Arth_3033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3033 
Symbol 
ID4444400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3399395 
End bp3400426 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content65% 
IMG OID639690857 
Producthypothetical protein 
Protein accessionYP_832512 
Protein GI116671579 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.233167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAC ACCTGACCAT CATGAGTGCC GCGGCCGCCG TCGTCATCGC GATGACGGTG 
TCGGGCTGCG GCGGCGGCGC CGCAGGGGCA ACCTCGGCCG GCGGAAGTGC CGGCGGCGCC
ACCGAGGTGA AGGAGCTCCG CTACCAGGGC TGGGCCAACA CGGTAACGCT GCCGGAACTT
GCCCAGGACC TCGGCTACTT CGGCGACGTC AAGCTCAACT GGGTGGGCAA CACCATCAGC
GGCCCGCAGG ACATCCAGTC CGCGGCCACC GGGCAGACGG ATTTTGGTGG CGCGTTCGCC
GGAGCGGTGG TGAAGCTGGT GGAAGCCGGC GCCCCGGTCA AGGCCGTCAT CAACTACTAC
GGCGAAGACG AGAAGACCTT CAACGGCTTC TACGTCAAGG AAGACAGTCC CATCCGCACG
GCCCGGGACT TCATCGGCAA GAAGATCGCA GTGAACACCC TCGGAGCACA CGCGGACGCC
GTCATCAACA CCTACCTGCA GAAGAACGGT CTGAGCGCCG AGGAAATCAA GCAGGTGCAG
CTGGTGGTGG TGCCGCCCAA CGACACCGAG GAGGCCATCC GCCGCGGCCA GGTGGATGCC
GGTTCGCTGG GCAGCATCCT GCAGGACAGG GCGATCGCAA ACGGCGGCCT GCGGTCGGTG
TTCAGTGACG CGGAACTTTT CGGCACCTTC GCCGGCGGCC CCTACGTGCT GCGCACCGAC
TTCATCGCGA AGAACCCAAA CACCACCCGC ACATTCACCA CCGGGGTGGC CAAGGCCATC
GAATGGGAGC GGACCACGCC CCGCGAGGAA GTGATCGCCC GCTTTACCAG GATCCTGCAG
GAACGCGGCC GCAACGAGAA CCCGGCAGCG CTGCAGTACT GGAAGAGCGT GGGCGTACCC
GCCAAGGGCG AGATCAAGGA TGAGGATTTC ACCCGCTGGG GCAAGTGGCT CAAGGACACC
GGAATCATCA AGGGCGAACT GGACCCGAAG AAGCTCTACA CCAACGAGTT CAACGCCCTG
GTGACCGGAT GA
 
Protein sequence
MKRHLTIMSA AAAVVIAMTV SGCGGGAAGA TSAGGSAGGA TEVKELRYQG WANTVTLPEL 
AQDLGYFGDV KLNWVGNTIS GPQDIQSAAT GQTDFGGAFA GAVVKLVEAG APVKAVINYY
GEDEKTFNGF YVKEDSPIRT ARDFIGKKIA VNTLGAHADA VINTYLQKNG LSAEEIKQVQ
LVVVPPNDTE EAIRRGQVDA GSLGSILQDR AIANGGLRSV FSDAELFGTF AGGPYVLRTD
FIAKNPNTTR TFTTGVAKAI EWERTTPREE VIARFTRILQ ERGRNENPAA LQYWKSVGVP
AKGEIKDEDF TRWGKWLKDT GIIKGELDPK KLYTNEFNAL VTG