Gene CPS_4148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_4148 
SymbolmutY 
ID3522464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp4362430 
End bp4363518 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content39% 
IMG OID637286591 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_270802 
Protein GI71281992 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.220216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATC CTATAGTAAT ATCAGCCAAA TCGGCTGAAC AATTTGGCCA GCAAGTCGTA 
AGCTGGTATC ACCTACAAGG TAGAAAGCAC TTACCTTGGC AACAAGATAA AACCCCATAT
AGAGTGTGGA TTTCAGAGAT AATGTTACAA CAAACACAAG TTGCGACAGT TATCCCTTAC
TATCAACGTT TTATGGAAAG TTTTCCGACA ATTACCGACT TAGCCAATGC TGATGAAGAT
GTGGTTTTAC ATCATTGGAC TGGTTTAGGC TATTATGCTC GAGCTCGTAA TTTACATAAA
TCAGCTAAAA TCATGCTCAA TGACTATGAT GGCCATTTTC CCATTGAAAT TGAGCAAGTT
ATCGCTTTAC CTGGCATAGG TCGCTCGACC GCTGGCGCTA TTTTAAGTTT ATCGTTAAAA
CAATATCATC CTATTTTAGA CGGTAATGTA AAACGGGTGC TGGCACGAAG TTACCTTGTT
GAAGGTTATA ATGGCTTAAG TAAATTCGAT AAAGCGTTAT GGCAATTAAG TGAGAAATTA
ACGCCTGCCA TTGAAACCGA TAGTTTTAAT CAAGCGATGA TGGATCTTGG GGCAACTGTG
TGTACTCGTA GTAAACCAAG CTGTGATATA TGCCCCGTTG AGCAAAGTTG CCTAGCCAAA
GCGGGTGATC AGCAAATGAA TTTTCCTCAG AAAAAACCTA AGAAAAAAAT TCCTGAAAAA
CAAACAATCA TGGTGATCCC AAGATTGAAA AACGAAAACT GCGATAAAGT TTTAATGGAA
AAGCGTCCTC CTGTTGGTAT TTGGGGCGGC TTATGGTGTT TTCATGAGGT TGATGAGCTA
AGCGAAATTA ATGACTTAAT GACGAGTTTG TCACTTAAGG AAATTTCATC ACAAACCCTA
ACTGAGTTTA GGCACACTTT CAGTCATTTT CATTTAGATA TTACTCCCGT GGTAGTAGAC
TGCCAGCAAC TTGAAGTTTC AAAAATAAAC GAACCTAATC AGCAAAAGTG GTATGATTTA
CACCAAGGAT TGAGTGTCGG CCTAGCGGCT TCCACACAAA AACTACTTAC TTTGCTTAGA
GACTGTTAA
 
Protein sequence
MNNPIVISAK SAEQFGQQVV SWYHLQGRKH LPWQQDKTPY RVWISEIMLQ QTQVATVIPY 
YQRFMESFPT ITDLANADED VVLHHWTGLG YYARARNLHK SAKIMLNDYD GHFPIEIEQV
IALPGIGRST AGAILSLSLK QYHPILDGNV KRVLARSYLV EGYNGLSKFD KALWQLSEKL
TPAIETDSFN QAMMDLGATV CTRSKPSCDI CPVEQSCLAK AGDQQMNFPQ KKPKKKIPEK
QTIMVIPRLK NENCDKVLME KRPPVGIWGG LWCFHEVDEL SEINDLMTSL SLKEISSQTL
TEFRHTFSHF HLDITPVVVD CQQLEVSKIN EPNQQKWYDL HQGLSVGLAA STQKLLTLLR
DC