Gene Arth_3987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3987 
Symbol 
ID4447752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4503059 
End bp4504555 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content64% 
IMG OID639691818 
Producthypothetical protein 
Protein accessionYP_833462 
Protein GI116672529 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGGGAA ACTTCTACGG TGCGGACGTC GCACAGCTAC GCCAGCTCGC GAAGGATCTG 
GCGAAAGGCG CAAGCCGGCT GGAGCAGCTG GGGCAACAGT TGGGCGGCTC CATCGCCTCC
AGCCCCTGGA AAGGCAACGA CGGCGAGCGT TTCCGCAGTG ACTGGAACGG CTCCCACGCG
AAGACGCTCA GGGCTGCGGC TGCGGGCATC CACAGCGCAT CCAAAGCCCT CCTGCAGAAC
GCGGAGCAGC AGGACCGGGC CAGCACCGGA TCAGCAGGCG GCGGCGCGGG CGGCCCCGGC
ACCGGCACCG GTGGCAGTTC TCCCACGGAC GGCGCCGCCC AAGAACTGAC GGACCGGCTC
AACGGCATGA CGCCTGAGGA ACGCGATGCG TACCTGAACA GTGACGAGTT TAGGAACTGG
GCGCTTGCCA ACCCCGACGC GGCAAAGGCC GCCCTCGACG CTGCTGCCGA CTCCGGGCTG
ATTAACAAGA ATTCGCCGGA GTATTCGGCC TTCCTCACTG GCTACTGGAA CCGGCAGGCC
ATGCGGGACA TGGGGATCGA CCCCGCCGAG TGGGATACTT CCAAAGGCAC CGAGTACAAC
TGGGAGACCA TCAAGAAGGT CTACGATTTC TACGGCCAGG CCTACCTGTC CAACCCGGAC
CTGCAGTGGG CGGGCATGGC CAACATGATC GGTCCCTCAT TTGCCGGCGG ATTCAAGGAC
ATGGCCATGA TGCGGGACCT CGCCCAGCAG ATCGCGGACA ACCCGGCGTC GGACATTCCG
CTTCCCATGC TGGATCAGAT CGAGCAGCTT GCGGGCATGA CTGATCAGGA GATCAAATTC
TACGAAACCA GCATGCTGGA CATGAACAAG GAGATCTTCC TTGACCAGGC CCGGCAGCAC
CAGGCTTATA TGACTGGCGG ACTGGACGAA ATCAACCGGC TCCGGGACTC CGGGGTGATC
GATCACCCGA CAGCACGCGC CTGGTCACAG ATCGACTCCG GCGACCCGGC CCAGATTCAG
GAAGGCAACA CTGCCCTCCT CTACCGGGAG CAAAACGAAA TCATCGCCGA CGACTACGAC
ACCATGCGCA GCCATCCCGG CGGCGAAGCC GTGACCTACA TGGTGACGTT GGCTGGCGAA
CCGTCCATCC CCGGCGCCAG GAGCTACCCC GAGGTGTTCC CCTACAAGTT CAGTGTGGAA
AGTCCGGGCC CGGAAAACGT GCCCTTCACC AACTGGGACA ATCCCGCCCA GTTCCGCACG
GACTTCACCA CCGGCTTCCC GGACGGCAAC ATCGCCAACG CCGACCAGCG CTGGGCCCTC
ATCCAGCAGG ACACCCTGCC GGCCTACCAG AACCTCCTGG CCACCGACCC CGCGGGAGCC
CGGCAAATCA TCGGCTCCGA CTTCAACGAC CGCGTGGACC AATACCGCCC CACCAATAAC
ATTCCGGGAA TCATGGACCG CTTCGTCTCC GGCTTCGACG CAGAGGTGCA CCAGTGA
 
Protein sequence
MAGNFYGADV AQLRQLAKDL AKGASRLEQL GQQLGGSIAS SPWKGNDGER FRSDWNGSHA 
KTLRAAAAGI HSASKALLQN AEQQDRASTG SAGGGAGGPG TGTGGSSPTD GAAQELTDRL
NGMTPEERDA YLNSDEFRNW ALANPDAAKA ALDAAADSGL INKNSPEYSA FLTGYWNRQA
MRDMGIDPAE WDTSKGTEYN WETIKKVYDF YGQAYLSNPD LQWAGMANMI GPSFAGGFKD
MAMMRDLAQQ IADNPASDIP LPMLDQIEQL AGMTDQEIKF YETSMLDMNK EIFLDQARQH
QAYMTGGLDE INRLRDSGVI DHPTARAWSQ IDSGDPAQIQ EGNTALLYRE QNEIIADDYD
TMRSHPGGEA VTYMVTLAGE PSIPGARSYP EVFPYKFSVE SPGPENVPFT NWDNPAQFRT
DFTTGFPDGN IANADQRWAL IQQDTLPAYQ NLLATDPAGA RQIIGSDFND RVDQYRPTNN
IPGIMDRFVS GFDAEVHQ