Gene Arth_1073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1073 
Symbol 
ID4446444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1159075 
End bp1160508 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content64% 
IMG OID639688879 
Productband 7 protein 
Protein accessionYP_830567 
Protein GI116669634 
COG category[S] Function unknown 
COG ID[COG2268] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.532983 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGACT TGTCAGCATT CTTCCCGTTA ATTGCCGCTC TCATCGGGGC AATTGTTGTC 
ATCGGCTTTA TCTGGGTGGC AATAAAACTG ATGTGGAAAG TGGCTGAACC CAACGAGGCC
CTGATCATCT CCGGCCTGAC CCGCGGAACC CTTGAAACGC GGGCCGGAAT GGACTTCAAG
ATTGTCACGG GCAAAGGTGC GCTGGTGTTT CCCGGGCTTC AGACGGTACG GACCCTGTCC
CTCACACTGA ATGAAACTGA GCTCAAAGTT TCCTGTGTTA CCTCGCAGGG CATCCAGGTA
ATTGTGGAAG GTGTGGTGAT TTACAAGATC GGCGATGCCC CGCCCTTCAT TGCAAATGCG
GCCCGGCGTT TCCTGGGCCA GCAGCCCAAA ATGGAAAGCC AGGTGTACAA CGTCTTTGAA
GGGCACCTGA GGTCCATCAT CGGCAGCATG ACCATGGAGG AGATCATCCG CGAGCGGGAC
AAGCTCGGTT CGCAGGTCCG CAGCGCCAGC GGTGTGGAAA TGGAGAAGCT GGGCCTGGTG
GTGGATTCGC TCCAGATCAA GGACCTGCAG GACCCCACCG GCTATATCCA GAACATCGCC
AAGCCGCACA TCGCCCAGGT GAAGATGGAA GCCCGCATCG CCGAGGCCAC CAGGAACCGC
GAAGCGGCCG AGAAGGAAGC GGAGGCGGCG GCGCTCATCG CCGACGCTCA GAGCGTCTCC
GCCATCAGGC AGTCGGTGGC GCAGGCCAAT GCCGAACGGG CGAAGGCCAA CGCCGCCCAG
GCTGGGCCGC TCGCGGATGC GACGGCGCGG CAGCAGGTTG TGGTCCAGGA AACCGAGGTG
GCCAAGCTCG AGGCTGACCG CGAAGAGCAG AAGCTCCAGA CCACCATCCG CAAGCCCGCC
GACGCCAAGG CCTACGCCAA GCGCACGGAC GCCGAAGGCC AGAAGGCCGC GGACATCAGC
GCCGCGGAAG CGCTGGCCCG CCGCACCGAA CTAGAAGCCC AGGTCAACGC CCGGCGGACG
GAACTGCAGG CCCAGGCCAA TGCCACGGCT GCCGCGGCCG CGGCCGGCGC CACGAAGGTC
ACCGGCGAGG CGGAAGCCGC AGCCACCCGG GCGCGCGGCG ATGCCGCCGC ATCGGCCATC
AAGGCCAAGG CACTGGCGGA GGCGGAGGGC ATCAAGGCCC GCGCCGAGGC ACTCGGGACC
AACCAGGATG CCGTCATTTC TCAGCAGCTG GCCGAGAACA TGCCCGCTAT CATCGCCGCG
GCGGCTGAGC CGTTCTCGCA CGTGGGACAG ATGACTGTGC TCAACGGCGG GGAAGGCGTC
AACAAGATGC TGGGCGGGAT TCTGGCCCAG GTGGGCGACT ACCTTCCGGC GCTCTCTTCG
GCGCTGAAGA ACAGCAGGGA AGGCAAGCGG CCGGCGAAAG CCCCAGATGC GTAA
 
Protein sequence
MPDLSAFFPL IAALIGAIVV IGFIWVAIKL MWKVAEPNEA LIISGLTRGT LETRAGMDFK 
IVTGKGALVF PGLQTVRTLS LTLNETELKV SCVTSQGIQV IVEGVVIYKI GDAPPFIANA
ARRFLGQQPK MESQVYNVFE GHLRSIIGSM TMEEIIRERD KLGSQVRSAS GVEMEKLGLV
VDSLQIKDLQ DPTGYIQNIA KPHIAQVKME ARIAEATRNR EAAEKEAEAA ALIADAQSVS
AIRQSVAQAN AERAKANAAQ AGPLADATAR QQVVVQETEV AKLEADREEQ KLQTTIRKPA
DAKAYAKRTD AEGQKAADIS AAEALARRTE LEAQVNARRT ELQAQANATA AAAAAGATKV
TGEAEAAATR ARGDAAASAI KAKALAEAEG IKARAEALGT NQDAVISQQL AENMPAIIAA
AAEPFSHVGQ MTVLNGGEGV NKMLGGILAQ VGDYLPALSS ALKNSREGKR PAKAPDA