Gene Arth_2104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2104 
Symbol 
ID4445373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2374862 
End bp2376112 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content67% 
IMG OID639689912 
ProductFeS assembly protein SufD 
Protein accessionYP_831584 
Protein GI116670651 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0719] ABC-type transport system involved in Fe-S cluster assembly, permease component 
TIGRFAM ID[TIGR01981] FeS assembly protein SufD, group 1 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCCG AAATTACTAC TGAAAAGGCC CGCATCGGAG CGCCTTCCAT CGCAGGCTTC 
ACCGAGGAGG GGGAGGGGCT GGTCGCCGCC AAGGTGGACC ATAGCCACAA TCACGGCGTT
ACCGTGATGT CCTCCCGCGC CGAGCGCCTG ACGAGCTTCA ACGTTGCCGA CTTCCCGCTG
CCCACGGGCC GCGAGGAGGA GTGGCGCTTC AGCCCCGTCC GTGCCCTGGC CAACCTTCTC
TCCGACACCG CAACGGACCT CGAACCCGTG CCGGCCGCCG ACTTCACGGT CGAAGCGCCG
GAGGGCTACG TCCGCGGCAC GCTGGCTCCC GGAGCCGCGC CCCGCGGAAC CGTCCTGGTC
CCGGCTGACC GGGCCGCCGC CGTCGCGTCC GCCAACACGG ACGAGGCACT GCACGTGGTG
ATCCCGGCGG AAGCCGAGCC GGCCGAGCCG CTGCGGATCC TGGTCCACGG CGCAGCCGCG
GGCCGCCGCT CCAACGCACA CTATGTGCTG GAGGCAGGCG CCAACTCGCG CAGCGTTGTG
ATCCTGGAGC ACACCGGTGC CGCCGACCAC AACGGCAACC TGGAAGTCAT CGTCGGCGAG
GGCGCGCACC TCACCGTGAT CTCGGTGCAG CTCTGGGAAG ACGACGCCAG GCATCTGGCC
CAGCACGACG CACAGGTGGG CAAGGACGCT GTGTACAAGC ACATCGCCGT TTCCCTGGGC
GGCTCCGTCG TGCGCCTGAA CTCCAATGTG CGCTTCTCGG GCGAGGGTGC CGAGGCCGAG
CTGCTGGGCC TCTACTTCGC CGACGCGGGC CAGCACCTGG AGCACCGCTC CTTCGTTGAC
CACAACGTGC CCAACTGCAA GTCCAACGTG CTTTACAAGG GCGCCCTGCA GGGCAAGGAC
GCACACACTG TCTGGGTGGG CGACGTCCTG ATCCAGAAGC AGGCCGTGGG TACCGACTCC
TACGAGAAGA ACCAGAACCT GGTTCTGACG GACGGCTGCC GTGCCGACTC TGTTCCTAAT
CTTGAGATCG AAACGGGCCT TATCGAGGGT GCCGGCCATG CCAGCTCCAC CGGGCGTTTC
GACGACGAGC ACTTGTTCTA CCTCATGGCC CGCGGCATCC CCGAAGATGT TGCCCGCCGC
CTGGTTGTTC GCGGCTTCCT CAACGAGATC ATCCAGAAGA TCAAGGTTCC GGCACTCGAA
GAGCGCCTGA CTGACGCCGT TGAGCGCGAG CTCGCCGCGA GCGAGAACTG A
 
Protein sequence
MTAEITTEKA RIGAPSIAGF TEEGEGLVAA KVDHSHNHGV TVMSSRAERL TSFNVADFPL 
PTGREEEWRF SPVRALANLL SDTATDLEPV PAADFTVEAP EGYVRGTLAP GAAPRGTVLV
PADRAAAVAS ANTDEALHVV IPAEAEPAEP LRILVHGAAA GRRSNAHYVL EAGANSRSVV
ILEHTGAADH NGNLEVIVGE GAHLTVISVQ LWEDDARHLA QHDAQVGKDA VYKHIAVSLG
GSVVRLNSNV RFSGEGAEAE LLGLYFADAG QHLEHRSFVD HNVPNCKSNV LYKGALQGKD
AHTVWVGDVL IQKQAVGTDS YEKNQNLVLT DGCRADSVPN LEIETGLIEG AGHASSTGRF
DDEHLFYLMA RGIPEDVARR LVVRGFLNEI IQKIKVPALE ERLTDAVERE LAASEN