Gene Arth_1837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1837 
Symbol 
ID4445631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2061732 
End bp2063168 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content64% 
IMG OID639689655 
Product3-beta hydroxysteroid dehydrogenase/isomerase 
Protein accessionYP_831327 
Protein GI116670394 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCATAA CCAAACCACA CACCACGGTG CTTCTCACCG GGGCAACCGG CAACTGGGGG 
AGGGCGACCC TGCGCGAGCT CTCCTCGCGT TCGGACCGCG TCACCGTACT GGTGCTCTCC
TTACCAGGCG AAAAAGACAA GGCCGTACTG TCGGAGTTCT CGGCCATGGA GAACCTGGAT
GTTGTCTGGG GGGATCTGAC AGATTACGCC ACTGTTGCGA CGTGCGTAGC GCGGGCGGAT
GTGGTGCTCC ATGTGGGGGC GGTTGTTTCG CCTTTGGCCG ATGAGCAGCC TGAGCTGGCT
ACTCGTGTGA ACGTGGGCAG CATGCGGAAC ATCATCCGGG CGGTGAAGGC ACAGCCCGAT
CCCAGCCGGA TCAGGGTTGT CGGCGTCGGG TCGGTAGCGC AGACCGGGAA CCGCAACCCC
CCGCTCCATT GGGGCAGGGT CGGTGACCCA ATCCGCGTGT CCCGGTTCGA CGCCTATGGC
CAAAGTAAGG TGACAGCCGA GCGGGAACTT GTCGAGGCCG GCCTGCCGAC TTGGGTCTGG
CTGCGGCAGA CAGGAATCTT CCATCCCGGG ATGCTCGAAA TACGGGACCC CATCATGACC
CACTCGCCGT TCGCAGGAGT CATGGAATGG GTTTCGGCAC AAGACTCGGC CCGGCTGCTG
GCCAACCTCT GCGAACCGGA TGTCCCGGGC GAATTGTGGG GAGGTGTCTA CAACATCGGC
GGGGGCGAGG GCTGGCGGCT CAGCAACTGG CAACTGCAGA CGGCCATCGG CCAAGCTGTG
GGCGTGAAGG ACATCAGGAA GTGGTACGAC CGGAATTGGT TCGCGCTGAA GAACTTCCAT
GGACAGTGGT ACACCGACAG CGACCGGCTG CACGCCCTGG TCCCGTTCCG CCAGGACACG
TTCGAAAGTG CCCTCGCCCG CGCCCTCGCC ACGGCCCCCT CGTCAGTACG AAATGCCGGC
AAGGTCCCGG CCTGGATCGT CAAACACCTC GTCATGAAGC CGCTGTCCCG CAAACCCCGG
GGAACGATGG CAGCCATCAG GTCAGGAACC GACCAGGAGG TCAGCGCCCA TTTCGGCAGC
CTGCCGGAAT GGCGCAGCAT CGGCGACTGG TCCACGTTCG AGCCGCCCGC ACCCTCACGC
ACCCCGTCCT ATCTCGACCA CGGATATGAC GAGAACAAGC CTGCATCCGA GTGGTCCGCC
ATCGATTACC TGGAGGCAGC AGCCTTCCGA GGCGGCAGGC TTTTGACCGA AGACGTGAAC
CCTGGGCTTC CGTCGGCACC GCTCATGTGG TCCTGTGGGG CGGGCCATGA ATTCGCTGCC
AGCCCAAGGC TGGTGCTTCA GGCTGGCCAC TGGTGCCCCG CATGTACCGC CGATCCCGCA
GGCTACGACC GGCAAGCCGA GCACAACAAA TTCCTCGCTC AGGTCATCGA TGCATGA
 
Protein sequence
MSITKPHTTV LLTGATGNWG RATLRELSSR SDRVTVLVLS LPGEKDKAVL SEFSAMENLD 
VVWGDLTDYA TVATCVARAD VVLHVGAVVS PLADEQPELA TRVNVGSMRN IIRAVKAQPD
PSRIRVVGVG SVAQTGNRNP PLHWGRVGDP IRVSRFDAYG QSKVTAEREL VEAGLPTWVW
LRQTGIFHPG MLEIRDPIMT HSPFAGVMEW VSAQDSARLL ANLCEPDVPG ELWGGVYNIG
GGEGWRLSNW QLQTAIGQAV GVKDIRKWYD RNWFALKNFH GQWYTDSDRL HALVPFRQDT
FESALARALA TAPSSVRNAG KVPAWIVKHL VMKPLSRKPR GTMAAIRSGT DQEVSAHFGS
LPEWRSIGDW STFEPPAPSR TPSYLDHGYD ENKPASEWSA IDYLEAAAFR GGRLLTEDVN
PGLPSAPLMW SCGAGHEFAA SPRLVLQAGH WCPACTADPA GYDRQAEHNK FLAQVIDA