Gene Haur_4527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4527 
Symbol 
ID5736378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5792672 
End bp5793931 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content50% 
IMG OID641281689 
Productarsenical pump membrane protein 
Protein accessionYP_001547286 
Protein GI159901039 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1055] Na+/H+ antiporter NhaD and related arsenite permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.142525 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACTGC TACTTGGTGG AACAATTTTT GGAGCCACGC TCTTAGGCGT GATGGTTCGT 
CCGCGCAATA TTTCGGAAGC TTGGGTCGCG TTACTTGGCG CAGTTGCCAT GTTATTGGTC
GGAATTTTGC CACTGAAGGC AATTTTGCCG ACGCTTGCCC GCGAATGGAA TGTGTATGGT
TTTTTTGCCG GTTTGATGCT GATTGCCTTT TTTGCCGAGC AAGCTGGGGT GTTTCAGGCC
TTGGCGTTGC AGGCAGCACG TTGGGCCAAT GGCTCGGCGC AGCGGCTCTA TTTGGCGGTC
TTTTTGGTGG GCACGCTGAT TACAGCGGTG CTCTCCAACG ATGCGACAGC CTTGATTTTA
ACTCCAGTGG TTTGGACGTT GGCCAGCCGT TTGCGCTTGC CAGCCTTGCC ATTTATGTTT
GCCTGCACCT TTATTGCCGA TACTGCTTCG GCATTGCTGC CCGTTTCCAA TCCGATTAAT
ATTTTGGTGC TAACCCGCTT TAATCGTGAG CTGTTGGAAT ATTGGGCCTA TTTGTTGGTT
CCGTCACTGG TGTGTATTGG CTGGAATATT GGGCTATTTG CTTGGCTGTT TCGGCGCGAT
TTGCAGGGCA GCTACGATTT GGCACTGCTT GATGATTTAA CTATCGCCAA CCCACGCCTT
TATCGGACAA CCCTTGTTGG CTTGGGCAGT ATTGCTGTGG CCTATGTAGC TGGCTCGTTG
TGGCACGTGC CATTGGCCTT TGTGGCGTTG GCTGGAGCCG CTCTCTTAGC GGCAATTAGC
TGGTGGAATG GCACGTTTAA GCCCAAACAA GCCTTGCACG AATTATCTCC AGCCTTGTTT
GGCTTTATTA GCGGCATGTT TTTGGTGGTA CGGGCGATTG AGCAATTAGG CTGGACTGAA
CGCTTTGGCG CGAGTTTGTT ACAGGGCAGC GGCGCGAGTT TGGGCAATAT TGCGCGGGTA
ATTTTCGGTA GTGCACTTGG CTCGAACATG ATCAACAATG TGCCGATGAC CTTGGTGATG
ACCTCGACGC TTGAACATTT GCCTAGCACA CCTGAGCCTG CGTTGATTTA TGCCACCATT
TTTGGGGCTG ACCTTGGGCC AAATTTAACA ATTGTTGGCT CGTTGGCTAC AATGCTGTGG
TTGGTGATTT TACGGCGCAA AGGTCTGGAA ATTAGTGCCA AACAATACTT TAAATTGGGC
TTGCTGTTTG TGCTACCATC CTTATTAATT GGTACATTTT GGATGTGGCT GATGGCATGA
 
Protein sequence
MQLLLGGTIF GATLLGVMVR PRNISEAWVA LLGAVAMLLV GILPLKAILP TLAREWNVYG 
FFAGLMLIAF FAEQAGVFQA LALQAARWAN GSAQRLYLAV FLVGTLITAV LSNDATALIL
TPVVWTLASR LRLPALPFMF ACTFIADTAS ALLPVSNPIN ILVLTRFNRE LLEYWAYLLV
PSLVCIGWNI GLFAWLFRRD LQGSYDLALL DDLTIANPRL YRTTLVGLGS IAVAYVAGSL
WHVPLAFVAL AGAALLAAIS WWNGTFKPKQ ALHELSPALF GFISGMFLVV RAIEQLGWTE
RFGASLLQGS GASLGNIARV IFGSALGSNM INNVPMTLVM TSTLEHLPST PEPALIYATI
FGADLGPNLT IVGSLATMLW LVILRRKGLE ISAKQYFKLG LLFVLPSLLI GTFWMWLMA