Gene Arth_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0468 
Symbol 
ID4447058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp496333 
End bp499605 
Gene Length3273 bp 
Protein Length1090 aa 
Translation table11 
GC content63% 
IMG OID639688266 
ProductYD repeat-containing protein 
Protein accessionYP_829968 
Protein GI116669035 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTGT CCTTTGTTGT TGTGACATCA GTGGCTTTGA CAGGGATGGG CGCACCTGCT 
GTGGCCGCCC TTGACGCGTC AGTACTGACC GCCCCGGCGG CTCCATCACC TCCGTCCAGT
CCAGTAACGG AGGCGAAACT GGAACCGGCG CCTGAGCCTG TCCAGGTCGA GCCAGAACCG
GGGGATGAAG TACCTTCCTC TGATGCTGAA GGTACGCCCA TCATCCCGGA GGAAGCGCCG
GCGGCTTTGG CTGTCGCCGC CTTGGCCCCG GCAACCGGAG CGAATGGCGT CGCGTCAGGC
AACCGTGCCG CTGCGTCCAT GGTCACCCGC CAGCTTTCGG ATGCCGCCGC ACTTTCCTGG
AATCCGACGA ACGGGAACAT CGTGCTGACC GGGGATCTGC TGCATCTGAA AGGCGTCACC
CGGGACATGG ACGTGAAATG GCGCTACAAC AGCCTCAATG ACGTCCGCCC AACGCTTTCG
GTCGGTACCC GTGAAGCTGG CGTCAGGGTC GATGCGTCCA ACAACATCAC GTACACCGCA
GCGGACGGCG GGGAGTACAC CTTCGTTCCC GACGGAGCCG GAGGATGGAC CCGGCCCGCA
GGCCTGAATG CCTCTATCCG GTCGCTTTCC GCAACCAACC TCAACATCAA TTTCGATGAC
ACCGGATATG GCAATGGATA CGTCAAGAGC GGGGACGTGT ACGTCCTCCG AAGCGAGGAC
GACCACTACG GAGACCTTCC CAACCGGGCC TCGTACTACG ACACCGACGG CCGACTCAGT
GCCAGCTCGG ATAACCGGAA CCAGATCATC AACTACGTCT ACCAGGACGC GAATAACGAG
GAACAGCCCT CCCAGATCAT CGACACCACG ATCAACCGGG CGATCAACAT CGAGTACAAC
GGCGGATCCG GACGGATGTC CAAGATCACC AACGCCTCCG GCGCGGCCCT GTCCTTCACC
TACAACTCCG CCGGCAAGAT TGCCACCGTC AAAGACGGCC GCGGCACCAC CACCACCCTG
GAGTACGACA CCAATGGCAG GGCAAAGAAA ATAACCTACG GCACCGGCAC CACCGCCCAA
AGCATCTTCA CCCCGGCTTA CCCCACCGCG ACATCCTCGA CGCTGACGGA CCCGAACAAT
AAAACAGCGA CCTACGCTTT CACAGGAGCC CGGCAGGTCA CCCAGGTCAC CGATCCGAAC
GGGAACACCA CCCAAGCCAC CTGGGACGCC CACGACAACC GCCTGACCTC AGTGGACGGG
CTGGCCAAGA CAACCACCGC CACGTACAAC CTCAATAACT CCCTGACCAA GATCACCAAG
CCCGCAGCCG GGACCGGGAC AGGAAGCGAA ACGATCTACA CCTATGCCGG CCCGAACGGC
CGCCTACCGA TCGCGTCCAC AAACAGCGAA GGACACGTCA CCACCTATGG CTACGACGTG
AACACTGCCA ACATCGACAA GGCACAGACA CCAGGGCCCA ACGGCGGCGC AGGCGGCGCC
CGAACCTGGA ACTTTGAGCG CGATGACGAA CTAACCACCT GCGGAGCTCA GCGCGGTCAG
CTGTGCAAAA CCACGGACGG CAACGGCAAC GTCACCAGCT ACGCCTACGA CGCAAACCGC
AACCCCGTCA CGATCACCCG CCCAGCACCC CTTGGTGTCA TCACCAACAC CTTCGACGCC
GCAGACCGGC TGATCACGTC CAAAGATGGC AAGAACCAGA CCCAGACCCA CACCTACGAC
AACAATGACC GGATCACCCA GACCCGCCAG GGCGCCACCT GCATCCCCGC GACCTGCGTC
ACCTACACCT ACGACGCGAA CGGCAACCTC ACCCAGCGCG TCGACGCAGG GGGCACCACC
ACCATCACTT ACGACGCCCA GAATAGGCCA ACGACTAAGA CCATCGGCGG CACCACCACG
ACGCTCACCT ACGACGGGGC CTCCAACATC CTGACCAGTG TGGACCCGCT CGGGACCGTG
ACCTATAAAT ACGACGCCGG GAACCGGCTG ATCTCCCTCG CCGAGCCCGG CGGATCCTGC
CCCGCAACCC CGGTATTTCC GAACAGCACC AAATGCACGG GCTTCGAATA CGACGCCAAC
AACAACCGCA CCGCCACGAA ATACCCCAAC GGGATGAAGA ACACCACGGT CATCGACGCG
GCCGGGCGGA CCACCTCCAT CACCGCGACC AACACCACCG CGGGCGTCCT GGCCAGACGC
GCCTACACCT ACACCGTCAA CGGCACCAAA GACGGTGCCC TGCGCAAAAC CGTCACCGAC
CATGCAGGCA CCGTCACCAC CTACAACTAC GACGAGGTCA ACCGGCTCAC CCAGGCCGTG
ACCGGGACCA ACACCGAAAC CTGGGCCTAC GACAAGAACA GCAACCGGAC CGTCGATACC
AAGACCGGGA CCGCGAACGT GTACAACGCC TACAACGGAG CCGACCAGCT CTGCTGGGTA
GGGGCCAGCG CCGGGACCTG CGCCTCACCC CCGGCAGGGG CCGTCACCTA CGCCTACGAC
GCCAACGGCA ACACCACCAC CGCCGGGGCA ACCACCCAGA CCTACAACGT CTTTGACCAG
TTCACCTCCA ACACCAACGG CGGCACCACC AACTACGCCT ACGCAGGCAC CCGCAACGAC
GAACGCACCA CCGCCGACGG AACCGCGTTC CTCAACGGTG CACTGGGCAT CACCCGCCAA
ACCACCGGCG GGGCAGCCAC GTCCTTCATC CGTGACCCGG ACGGTAACCT CGTCAGCATG
CGCACCAGCA CCGGAGCCAG CTACTACTAC ACCACCGACG CCCTCGGCTC GGTCATCCTC
CTCACTGACA GCGCCCAGAC GAAAGCCGCC GAATACGCCT ACGACTCCTG GGGACTGACC
ACCACCAACA GCGGCGCCCA GGCAGCTGTG AACCCGTGGA CCTACGCCGG CGGGTACAAC
GACACCACCA GCAACCGCAT CAAATTCGGC GCCCGCTACT ACAACCCCTG GCGCGGACGC
TTCACCCAAC CCGACCCATC AGGCCAAGAC CAAAACCGTT ACGCCTACGT AAGCTGCAAC
CCAATCAACG CCACAGACCC AACGGGCCTC GGCCCTGCCG AATGCTTTTT CAACGCTTCG
GCCGCTGTAT TTTCCGCTTT CGTTGTTGCG GGGGCAGGCG CCGCCGCTGT CGCAACTGGC
GGTTTGGCAG TCGTTGGCTT GATCGGGGCG ATTGGACTTG AGGCTACTAC TGCCACTGCC
GCTGGCTACT ACTGTGGAGA GCTGATCAAA TGA
 
Protein sequence
MALSFVVVTS VALTGMGAPA VAALDASVLT APAAPSPPSS PVTEAKLEPA PEPVQVEPEP 
GDEVPSSDAE GTPIIPEEAP AALAVAALAP ATGANGVASG NRAAASMVTR QLSDAAALSW
NPTNGNIVLT GDLLHLKGVT RDMDVKWRYN SLNDVRPTLS VGTREAGVRV DASNNITYTA
ADGGEYTFVP DGAGGWTRPA GLNASIRSLS ATNLNINFDD TGYGNGYVKS GDVYVLRSED
DHYGDLPNRA SYYDTDGRLS ASSDNRNQII NYVYQDANNE EQPSQIIDTT INRAINIEYN
GGSGRMSKIT NASGAALSFT YNSAGKIATV KDGRGTTTTL EYDTNGRAKK ITYGTGTTAQ
SIFTPAYPTA TSSTLTDPNN KTATYAFTGA RQVTQVTDPN GNTTQATWDA HDNRLTSVDG
LAKTTTATYN LNNSLTKITK PAAGTGTGSE TIYTYAGPNG RLPIASTNSE GHVTTYGYDV
NTANIDKAQT PGPNGGAGGA RTWNFERDDE LTTCGAQRGQ LCKTTDGNGN VTSYAYDANR
NPVTITRPAP LGVITNTFDA ADRLITSKDG KNQTQTHTYD NNDRITQTRQ GATCIPATCV
TYTYDANGNL TQRVDAGGTT TITYDAQNRP TTKTIGGTTT TLTYDGASNI LTSVDPLGTV
TYKYDAGNRL ISLAEPGGSC PATPVFPNST KCTGFEYDAN NNRTATKYPN GMKNTTVIDA
AGRTTSITAT NTTAGVLARR AYTYTVNGTK DGALRKTVTD HAGTVTTYNY DEVNRLTQAV
TGTNTETWAY DKNSNRTVDT KTGTANVYNA YNGADQLCWV GASAGTCASP PAGAVTYAYD
ANGNTTTAGA TTQTYNVFDQ FTSNTNGGTT NYAYAGTRND ERTTADGTAF LNGALGITRQ
TTGGAATSFI RDPDGNLVSM RTSTGASYYY TTDALGSVIL LTDSAQTKAA EYAYDSWGLT
TTNSGAQAAV NPWTYAGGYN DTTSNRIKFG ARYYNPWRGR FTQPDPSGQD QNRYAYVSCN
PINATDPTGL GPAECFFNAS AAVFSAFVVA GAGAAAVATG GLAVVGLIGA IGLEATTATA
AGYYCGELIK