Gene Haur_1579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1579 
Symbol 
ID5733466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1835745 
End bp1836773 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content47% 
IMG OID641278718 
ProductNMT1/THI5-like domain-containing protein 
Protein accessionYP_001544350 
Protein GI159898103 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00570901 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTATC GTCGTATCTT AGCCGTGGTT GGGTTATTCT TTTTGGCGGC TTGTGGTGGG 
CAAACTGCTA CGCCAACCGC CGTTACGGGC AATGATCAAG CGAAACCGCT AACCAAAGTA
ACGATTGCCA TGCCCTATGT GCCGAATATT CAATTTGCCC CGTTTTATTT GGCCAAAACC
CAAGGCTACT ACGAAGCTGA AGGCTTAGAT GTAACCTTCG ATTATCAATA TGAAACTGAT
TCGGTGCAGC GTGTAGCTAA TGGTTCTGTT CAATTTGGCA TGGCTGGCGG CGATTCGGTG
CTGCTAGCAC GAGCGCAAGG CTTGCCTATT ATGACTGTTG CAACGATCAG TCAACGCTCG
CCGATTGTTT TTTATAGCAA AGCTGAGCTG AATATCAAAA CTCCAGCCGA TCTCAAAGGC
AAAAGTGTTG GGATTCCAGG CCGCTTTGGG GCTTCGTATA TTGGTTTGTT GGCGTTGATG
TATTCAAATT CCTTGCAAGA GAGCGATTTG AACATTCAAG AAATTGGCTT TGCCCAAGTT
CAAGCGCTGA GCGAAGATAA AGTGCAGGTT GCCAGTGGCT ATGGCAATAA CGAGCCAATT
CAGTTGGCCG AGGCTGGGGT TAAATTAAAT GTTATTCGGG TGTCGGATTC GTTTGCCTTG
ACCTCTGATG GCCTTATTGT CAGTGAAAGC TTGATTAAAG AGCAACCCAC GGTGGTTATG
GGCTTTGTCA AAGCCACATT AAAAGGCATG AGCGCTACGA TTGCTGATCC GACGCAGGCC
TTTAATAGTA GTTTGCGTGA AATTCCCGAG CTGCAAGCGG CTGATGATGC GACCAAAGCC
TTGCAACAAA AAGTTTTAGC TGAAACAATT GGCTATTGGC AAAGCGATTC GACTGCTAAA
TATGGCCTTG GGTTTACTGA TCAGGCCACT TGGCAAGCGA CTCACGATTT CTTGCGCCAA
CAAAATATTC TCAAACAAGA TGTTGCAGTG GGCGAGTCGT TTGTGAATGG GTTTATTGCT
ACACCCTAA
 
Protein sequence
MRYRRILAVV GLFFLAACGG QTATPTAVTG NDQAKPLTKV TIAMPYVPNI QFAPFYLAKT 
QGYYEAEGLD VTFDYQYETD SVQRVANGSV QFGMAGGDSV LLARAQGLPI MTVATISQRS
PIVFYSKAEL NIKTPADLKG KSVGIPGRFG ASYIGLLALM YSNSLQESDL NIQEIGFAQV
QALSEDKVQV ASGYGNNEPI QLAEAGVKLN VIRVSDSFAL TSDGLIVSES LIKEQPTVVM
GFVKATLKGM SATIADPTQA FNSSLREIPE LQAADDATKA LQQKVLAETI GYWQSDSTAK
YGLGFTDQAT WQATHDFLRQ QNILKQDVAV GESFVNGFIA TP