Gene Haur_4182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4182 
Symbol 
ID5736044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5334020 
End bp5335324 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content55% 
IMG OID641281337 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_001546942 
Protein GI159900695 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.953072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGCGG TTGATCTGAT TATTAAAAAA CGCAATGGAG CACAGCTCTC AACCGAAGAA 
ATTCAATGGC TGATTCAGGG CTATACCAAC GGCAGTGTGC CCGATTATCA GATGGCGGCG
TGGGCCATGG CGGTTGTGCT CAAAGGCATG GATGATCGCG AAACCACCGA TTTAACCCTT
GCCATGGCCG CTTCGGGCGA TCAGCTCGAT TTGCGTGATT TCGCGCCCGA TGCCGTTGAT
AAGCACTCGA CTGGCGGGGT TGGCGATAAA ACTAGCCTTG TGCTGGGGCC AATGTTGGCA
GCCGTTGGTT TGCAAGTTGC CAAAATGTCG GGGCGGGGCT TGGGCTTTTC GGGCGGCACG
CTCGATAAAC TTGAGGCCAT CCCCAACATG CGCATCGACC TGAGCGAAGA TGAGTTTCGC
CATGCCATGC GTGAGATTGG CATGGTGATT ATGGGCCAAA CTGCTGATCT CGCACCTGCC
GACAAAAAGC TGTATGCGTT GCGCGATGTG ACTGGCACTG TCGAATGTAT TCCGCTGATT
GCAGCCAGCA TTATGAGCAA AAAGCTGGCG GCTGGAGCCA AAAGCATCGT ACTCGATGTC
AAGGTTGGGG CGGGGGCGTT TATGAAAACC CTCGATCAAG CCCGTGATTT GGCCCGAACG
ATGGTGCGGA TCGGCCAATT GGCTGGCCGC AATGTCGCTG CGATCCTTTC GTCGATGGAG
CAACCGTTGG GCTTGACGAT CGGTAATGCG CTGGAAGTGC GCGAAGCGAT TGAAACGCTC
CAAGGTCGCG GCCCCGGCGA TTTGGTTGAA GTTTGTTTGA CCCTTGGCTC ACATCTGTTG
GTTTTGGCTG GCAAGGCCCA AAATCTTGAT GATGCCCGCC AACAGTTGCA AGCAAGCTTG
GATAACGGCC AAGCTTGGGC TAAATTCCGT GAGTTTGTTG CCCAGCAAGG CGGCGATCTC
ACGGTGATTG ACCAACCAGA AACCCTGCCA ATCGCCCCAA TTCAAATCAG TTTGCTGGCC
GAGAGCAGCG GCTTTGTCCA ACGCATCGAT GCCGAAACCT GTGGGATTGT GGCGACCGAG
CTTGGAGCTG GTCGCGCCCG CAAAGAAGAT GCGATCGATC CGGCGGTGGG CTTGGTGCTT
GAGCGTAAAG TTGGCGAGCC AGTTCAAGCG GGCGAGGCCT TGTTGACAGT GCATGCTGCT
GATCAGCAAC GGGCCGAGGT TGCTTTGGCC GCACTCAAAT CGGCAATTAC GATCAGTGCT
ACGTCGGTTG AAGCCTTGCC CTTGGTTTTC GAAAGCGTTG CCTAG
 
Protein sequence
MRAVDLIIKK RNGAQLSTEE IQWLIQGYTN GSVPDYQMAA WAMAVVLKGM DDRETTDLTL 
AMAASGDQLD LRDFAPDAVD KHSTGGVGDK TSLVLGPMLA AVGLQVAKMS GRGLGFSGGT
LDKLEAIPNM RIDLSEDEFR HAMREIGMVI MGQTADLAPA DKKLYALRDV TGTVECIPLI
AASIMSKKLA AGAKSIVLDV KVGAGAFMKT LDQARDLART MVRIGQLAGR NVAAILSSME
QPLGLTIGNA LEVREAIETL QGRGPGDLVE VCLTLGSHLL VLAGKAQNLD DARQQLQASL
DNGQAWAKFR EFVAQQGGDL TVIDQPETLP IAPIQISLLA ESSGFVQRID AETCGIVATE
LGAGRARKED AIDPAVGLVL ERKVGEPVQA GEALLTVHAA DQQRAEVALA ALKSAITISA
TSVEALPLVF ESVA