Gene Haur_2932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2932 
Symbol 
ID5734804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3707374 
End bp3708408 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content48% 
IMG OID641280076 
ProductNMT1/THI5-like domain-containing protein 
Protein accessionYP_001545698 
Protein GI159899451 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.01782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAAGC TTCTGTCTCG CCGAGGAGTT AGGATGCGCC GGTTAAGCTA TCTCAGTGTG 
CTCATGCTGG TGTTACTGGC TGCTTGTAGT ACGAGCCAAG CAACCCCCAC CCCAGCCCCT
AAGGATTCGG TCAAGCTTCA GTTGAACTGG GTTTTTGATT ATTCGTCGTC GGGCTTTTTT
GCTGCTGAAA AGAATGGTCG TTTTGGCGAG CAGAATTTGA ATGTCGAGTT GATTGCAGGC
GGTTTTGATG CTAACGGCTA TATTGATGGT ACTGAAAAAG TCAGTAGTGG GGCCGCTGAT
TTTGGGGTAG CCAGCGCCGA TAGTGTGATT CAAGCTCGTG CTAATGGCAA ACCTGTGGTT
GGGATTGCCG TGCTAACCCA AAATAGCCCA TTAGCGATTC TTTCCTTGCC TGGCGCTAAT
ATTCGCACGC CCCAAGATTT GGTTGGCAAG AAAGTGCTGG TCTCGGAAGG CGGGGCAACC
CAGCTTTACA ATACCTTGCT GACCGCCCAA GGCATCGATT TGGAGAGCGC CAAGCCCTTA
CCACGCTTCG ATTCAGGTAT CGATCAGTTA ATTGATGGCG AAATTGATGC GTTGGTGGCT
TGGAATATCA ACGAAGCAAT TGAATTAAGC GAGCGGGGCT ATCCACCCTC AATTATGTTG
ATGAGCGATT ACGGCATCAA TAGCTATGAG TTGGTGATTA TTACTACCGA AAAAATGGCA
ACTGAAAATC CCGATTTGGT CACCCGTTTC CTCAAAGCTA CCTTCAAAGG TTGGAATGAC
GTAATTGCTA ATCCAAGCCA AGCGGTTGAT TATGTTGTGA CCTACGATGT TAAGCTCAAT
CGCGATGCCC AACTACGGCG CTTAACTGAA ATGCTGAAGT TGATCAAGCC TGCGAACACC
AAAATTGGCG ATATGCGACC CGATCTTTGG TCGTTTACCC ACCAAATGTT GCAAACCCAG
GGCGCACTCA AAGAGCCAAT TGAGTTGGGT CGGGCCTATT CAACCTTGTT CTTAGATGTT
ATTCCTGACC GCTAG
 
Protein sequence
MVKLLSRRGV RMRRLSYLSV LMLVLLAACS TSQATPTPAP KDSVKLQLNW VFDYSSSGFF 
AAEKNGRFGE QNLNVELIAG GFDANGYIDG TEKVSSGAAD FGVASADSVI QARANGKPVV
GIAVLTQNSP LAILSLPGAN IRTPQDLVGK KVLVSEGGAT QLYNTLLTAQ GIDLESAKPL
PRFDSGIDQL IDGEIDALVA WNINEAIELS ERGYPPSIML MSDYGINSYE LVIITTEKMA
TENPDLVTRF LKATFKGWND VIANPSQAVD YVVTYDVKLN RDAQLRRLTE MLKLIKPANT
KIGDMRPDLW SFTHQMLQTQ GALKEPIELG RAYSTLFLDV IPDR