Gene Haur_0428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0428 
Symbol 
ID5732327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp499212 
End bp500303 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content52% 
IMG OID641277554 
Producthypothetical protein 
Protein accessionYP_001543207 
Protein GI159896960 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1668] ABC-type Na+ efflux pump, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATC GTGCAGTCAG CCGTCGTTGG AGCTTACAAC CCAACCCAAT CGTGGTCAAA 
GAGTTACGTT CGCGCATGCG CGGTGGCCGA GCGTTTATCT TGCTCACCGG CTTTTTGCTT
GGCTTAGGTT TGGTTGGCTA TGCTTTGTTT TTGCGAGCAA CCAGCGGCCA AGCAGCGGGC
ATGCCGATCT TTAGCGCCCA AATTGGCCAA ACCTTGTTTG CAGGTTTAGC CTTTATGTTG
CTGTTTTTGG TGATTTTAAT TGCCCCAGCA GTAACAGTTG GCGCTATCAG CAGCGAACGC
GAACGGCTAA CCTATGAAAT GTTGATCGCC ACGAATCTGC CATCACATCG CTTGCTGTGG
GGCAAATTGA TCTCAGCGCT TTCGTATATT GCGCTCTTGT TGCTGGCGGC GATTCCACTG
GGCAGTATCG TCTTTTTATT TGGCGGGATT ACGCTGCGTC ATGTGCTGCA AGCAGCCGCA
ATTATTGTGT TAAGCGCGAT CACCAGCGCC ATGCTTGGCA TTTGGGCTTC GGCCATGACT
GGTAAAACCG GACGGGCAGC AGTAGTCAGT TATGTGATTA TTGGCTTGGT GTTAGGTGGA
TTTTTGGCTG GAGCTGAAAT CTGGACTAAC CGCTCGACTG AACCTGTGCC CACTGGCCTC
TTAGCCCCCA ACCCGATCAG CGCCTTGGCT TCAAGCATTG GCACTTTGTC GGGGCCAAGC
AATGGCGGGA TGATTACCAC AATGCGAGCG TTTGAAGCAG TGCCAATGGC GGCTGGCGGC
GGTTGGTGGC CTGGAATGAA TTCGTTTAGT GGGGGCTTGA ATTTCCCAGG CTGGCAAACC
TTTTCAATTG GAATTTACCC AGCCACCGAT CAATTTGGCC CTGATGGCAT GCCAATTATT
GCCGAGCCAC GTTCGGTTTG GCGCACGACC TTGATTATTT TGATTATGAC ATGCTTGGTG
TTCTTCTGGA TGGCGCTGCA TGCGGTCAAA CCACGTCGTC GCTGGTGGCC AACTTGGGGC
GATTTGGCGA TGCTGTCGGT CACCTCGTTG AGTTTGCTTG GCCTTGGCTA TTGGATTGGC
TGGTGGTTTT AG
 
Protein sequence
MSNRAVSRRW SLQPNPIVVK ELRSRMRGGR AFILLTGFLL GLGLVGYALF LRATSGQAAG 
MPIFSAQIGQ TLFAGLAFML LFLVILIAPA VTVGAISSER ERLTYEMLIA TNLPSHRLLW
GKLISALSYI ALLLLAAIPL GSIVFLFGGI TLRHVLQAAA IIVLSAITSA MLGIWASAMT
GKTGRAAVVS YVIIGLVLGG FLAGAEIWTN RSTEPVPTGL LAPNPISALA SSIGTLSGPS
NGGMITTMRA FEAVPMAAGG GWWPGMNSFS GGLNFPGWQT FSIGIYPATD QFGPDGMPII
AEPRSVWRTT LIILIMTCLV FFWMALHAVK PRRRWWPTWG DLAMLSVTSL SLLGLGYWIG
WWF