Gene Arth_4355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4355 
Symbol 
ID4443466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp94908 
End bp95912 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content58% 
IMG OID639687676 
ProductABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic components 
Protein accessionYP_829373 
Protein GI116662319 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000418029 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTG AACCAATGGG ACGCCGCGGA TTTATCAAGA TCACAGCGGC AGCGGGCGGG 
ACGTTATTTC TGGGAGGCCT CACGGCGTGC GGCCAGGACA GCGGCGGCAG CGGCGGGGGT
AAGAAGGGAT ATTCCGGCGA CGTTGCTATT ACTGGTCTGG CCAGTCTTAT TCATTCCGCC
CCCTTCTTTA TCGCCCAGAC GGAAGGCTAC TACGAGGAAG AGGGGCTGAC CCTTGAGCAT
ATTCAGTTCC CAGGGGGGCT TGACACTGTT CGAGGAATCG AATCCGGCAT AGGTTTCGGT
ACGTCGTCCA CTATTCCCGT CTTTATCGCG GCCGAGAAGG GCATGGACGT AAAAATCTTC
GGCAATGTTT ATACGGCGGC TTCTGTCGAC TTCATTGCGC TGCCCGACTC TCCCGTCACC
TCCATTGAGG ATGTCAGGGG CAAGAAAGTG GCAGTCAGCA CGCCAGGATC AAACTCCTCG
TATTTCGCAG ACCGCACGCT GCGGGCCGCT GGCTTAGTCC CAGGCAAGGA CGTTGAACTT
ATCAGCGTCG GCTCAGCGAG CGATTCGTGG ACCGCCGTCT CCCAGAAAGT TGTTGACGTA
GCGTGGACGG CTTCGCCGCT TTCCGAGAAA ATCGCCTCCG AAAGCGGAGC GAAGGTGATC
TGGCGTTCCC GCGACTACGT AACGGACTGG TCGGACACCT GCCTCGTCGC GACAGGATCC
TTCATCGACG AAAATACCGA GGCGTTGAAG GGCTGGGGCC GTGCGCTCAA GAAGGCGATG
GACCTGATTA CCAACGACCT CGAAAAGGCC GCCGACGCCT ACGGGAAGGC AATCAAATAC
GAGCCCAAGG TGGCTCTTGA GGCGCTGAAG AACTCGCAGA ACTTCTACAG CCTGGACTTC
ACCGATGCCC AGCTGGCCGC CGTCGTCGCC GCCGGTAAAG AACAAGGCCA GATCACCAAA
GAACCGGACA TGAACGCCAT CGTTATGAGG AACTTCCTTT CATGA
 
Protein sequence
MSFEPMGRRG FIKITAAAGG TLFLGGLTAC GQDSGGSGGG KKGYSGDVAI TGLASLIHSA 
PFFIAQTEGY YEEEGLTLEH IQFPGGLDTV RGIESGIGFG TSSTIPVFIA AEKGMDVKIF
GNVYTAASVD FIALPDSPVT SIEDVRGKKV AVSTPGSNSS YFADRTLRAA GLVPGKDVEL
ISVGSASDSW TAVSQKVVDV AWTASPLSEK IASESGAKVI WRSRDYVTDW SDTCLVATGS
FIDENTEALK GWGRALKKAM DLITNDLEKA ADAYGKAIKY EPKVALEALK NSQNFYSLDF
TDAQLAAVVA AGKEQGQITK EPDMNAIVMR NFLS