Gene Arth_0027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0027 
Symbol 
ID4447531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp33439 
End bp34536 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content63% 
IMG OID639687821 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_829528 
Protein GI116668595 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCTT TCAGTTCGGA AAAAGGCGCC CTGCGCCCAT CGCGGCGGAT GGTGCTGGCA 
GGGTTTTCAT CGGCGGCGCT TGTGGCGCTG ACCGGCTGCA GGGGAGGATC AGGATCCGCT
GCCCCGGCTG CCTCTGGCAC GGGTTCAGGG GCGGCTGCCG ATTTCGGCAC ACTCGAGGTC
CAGTTGTCTT GGCTGAAGAA CGCGGAGTTC GCCGGCGAGT ACTTTGCCGA CAGCAAGGGC
TATTACAAGG ACGCTGGCTT CAGCGCCGTC AACCTGATCG CGGGCGGCCC GGGAGGGGCT
TCGGCGGAGA CCATGGTCCT CTCCGGGAAG GCCCTGGTGG GCACGTCGTC GCCCGTGGGT
GTGGCCCCGG TGGTGCTCAA CGAGGGAGCG CCGCTGAAGA TCATTGGTTC CACCTACCAG
AAGAACCCGT TCACCATCGT TTCGCTGGCC GCCAACTCGA TCACTGCACC GCAGGACCTC
GTGGGCAAGA AGATCGGCGT GCAGGCAGGC GTGAACGAAA CGCTGTTCGA TGCCCTCCTG
GAAGTCAACA AGATCGACCC CACGAAGGTC ACCAAGGTAC CGGTGCAATA CGATCCGCAG
CCGTTGCTGA ACGGCGACGT CGAGGGGTTC TTCGCGTACC TGACCAATGA AGTGCTCACC
CTTGAACTGG GCGGACACAA GACGGCGGTC CTGCCGTTTG CGGATAACGG CCTGCCCTTC
GTTGCCGAGA GCTTCGTGGT CACGGACGAG TCCATCAAGG ACAAGAGGCC GGAGTTGAAG
GCCTTCCTGG CGGCGACCAT CAAGGGCTGG AAAGACGCCC TGGCCGATCC CGATGAGTCT
GCACGGCTCG CCGTCGAGGT CTATGGCAAG GAGCTGGGCC TGACCATGGC CAAGGAAAAG
GGCCAGGCGG AAGCCCAGAA CACCAAGCTC ATCGCCACAC CGGAGACGGA AGGCAACGGC
CTGTTCACGG TATCCAAGGA GCTCATGGAC AAGAACATCG AGATCCTCAA GCTCGCCGGC
TACGACACCA CCGCGGACGC GATCTTTGAC CTGAGCCTCC TCGACGAGGT CTATGCCGAG
AACCCGGACC TGAAGTAG
 
Protein sequence
MTAFSSEKGA LRPSRRMVLA GFSSAALVAL TGCRGGSGSA APAASGTGSG AAADFGTLEV 
QLSWLKNAEF AGEYFADSKG YYKDAGFSAV NLIAGGPGGA SAETMVLSGK ALVGTSSPVG
VAPVVLNEGA PLKIIGSTYQ KNPFTIVSLA ANSITAPQDL VGKKIGVQAG VNETLFDALL
EVNKIDPTKV TKVPVQYDPQ PLLNGDVEGF FAYLTNEVLT LELGGHKTAV LPFADNGLPF
VAESFVVTDE SIKDKRPELK AFLAATIKGW KDALADPDES ARLAVEVYGK ELGLTMAKEK
GQAEAQNTKL IATPETEGNG LFTVSKELMD KNIEILKLAG YDTTADAIFD LSLLDEVYAE
NPDLK