Gene Arth_3780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3780 
Symbol 
ID4447830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4262548 
End bp4263738 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content67% 
IMG OID639691604 
ProductNLPA lipoprotein 
Protein accessionYP_833255 
Protein GI116672322 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATCGC ATCAGAACCC AGAAGGATCC ACCCGCATCG TGGCGGGCCA AAGCAGCAAA 
GTCCAAAACT CCGCAGCAAA GCGCAAACGC GCCCTCGGAA TCGGCATCGC GGCCGGGCTC
GTCGCCCTGA TCGCCGGCGG CGCGGCAGTG GCGTCGAACC TCGCCCGCAG CACCGAATCA
CAGCCGGCCG CAGCAGCTGC CACCGGCACG CCTGCCGCGG AGTTGAAGCT TGGCTTCTTC
GGCAACGTCA CGCACGCCCC GGCGCTGGTG GGGGTGAAGG AAGGTTTCAT TGCCGGGAGC
CTGGGCGGGA CGAAGCTGAG CACGCAGGTG TTCAATTCCG GTCCGGCCGC GATCGAGGCG
CTGAACGCCG GCGCGATCGA CGCCACGTAT ATCGGCCCGA ACCCGGCGAT CAACTCCTTC
GTGAAGAGCC GGGGCGAGTC AGTGAGCATC ATTGCCGGCG CCGCGGCGGG CGGCGCCCAG
CTGGTGGTGA AGCCGGAGAT CGGCTCGGCC GCGGATCTGA GGGGCAAGAC CCTGTCTACG
CCGCAGCTGG GCGGGACCCA GGACGTGGCG CTGCGCGCCT GGCTCGCCGG GCAGGGGTAC
AAGACGAACA CGGACGGCAG CGGGGATGTG GCGATCAACC CGACCGAGAA CGCGCAGACG
CTGAAGCTGT TCCAGGACGG CAAGCTCGAC GGCGCGTGGC TGCCGGAACC GTGGGCGTCC
CGGCTGGTGC TGCAGGCCGG CGCGAAGGTC CTGGTGGACG AGAAGGATTT GTGGGACGGG
TCGCTGACGG GCAAGCCAGG CGAGTTCCCC ACCACCATCC TGATCGTGAA CAAGAAGTTC
GCCGCTGACC ACCCGGACAC CGTCAAGGCC CTGCTGAAGG GCCACGCCGA GTCCGTGGCC
TGGCTCAACT CCGCGGCCGC CGCCGAGAAG TCCACTGTCA TCAATGCCGC CCTCAAGGAA
GCGTCCGGCG CCGAGCTGAA AGCCGACGTC ATTGAACGGT CCCTGAAGAA CATCGTCTTC
ACCGTGGATC CGCTGGCCGG AACCTACAAA AAGCTGCTTG AGGACGGGGT GAAGGCCGGC
ACCACCAAGC AGGCGGACAT CACCGGCATC TTCGACCTCA CCGCCCTGAA CAGCGTCACC
GCCGAAACAG GCGGCAGTAA GGTCTCCGCC GCCGGACTCG GCACGGACTG A
 
Protein sequence
MSSHQNPEGS TRIVAGQSSK VQNSAAKRKR ALGIGIAAGL VALIAGGAAV ASNLARSTES 
QPAAAAATGT PAAELKLGFF GNVTHAPALV GVKEGFIAGS LGGTKLSTQV FNSGPAAIEA
LNAGAIDATY IGPNPAINSF VKSRGESVSI IAGAAAGGAQ LVVKPEIGSA ADLRGKTLST
PQLGGTQDVA LRAWLAGQGY KTNTDGSGDV AINPTENAQT LKLFQDGKLD GAWLPEPWAS
RLVLQAGAKV LVDEKDLWDG SLTGKPGEFP TTILIVNKKF AADHPDTVKA LLKGHAESVA
WLNSAAAAEK STVINAALKE ASGAELKADV IERSLKNIVF TVDPLAGTYK KLLEDGVKAG
TTKQADITGI FDLTALNSVT AETGGSKVSA AGLGTD