Gene Haur_4027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4027 
Symbol 
ID5735888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5141869 
End bp5142984 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content52% 
IMG OID641281177 
ProductABC-3 protein 
Protein accessionYP_001546787 
Protein GI159900540 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1108] ABC-type Mn2+/Zn2+ transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGCA CGTTTGCAAT TATTCTGACT GGAATTTTGG TGGCGGCTGC CTGTGCCTCG 
CTTGGCTGTT TTCTCATCTT GCGGCGCATG GCGATGCTGG CCGACGCGAT TAGCCATGCG
ATTTTGCCAG GCTTGGTAGC GGGCTATTTC ATTGCGCAAG GCCCAAGTTT ATTGGCAGGC
TTTTTTGGGG CAGCCTTGGC AGGCCTGATT ACCGTAGGCT TGGTTGAGTT GCTCAAAAAT
ACCCAGCGCG TTGGCGGCGA ATCGGCAATT GGGATTGTGT TTCCAGCTAT GTTTGCCTTG
GGAACCTTCC TCGTCTCGAA ATATTATGCC GATGTTCACC TTGATGCCGA TGCGGTGCTC
TATGGCAACA TCGAGTTTGT CGGCTTCGAC CATTGGTATC TTGGCGAAAC CGATTTAGGG
CCGCAATCGC TGTGGATCAT GGGCTGCTTA TGTTTGATCA ATTTGATTTT TATTAGTGTG
TTTTACAAAG AACTCAAATT GGCTACTTTT GATGCTGGCT TAGCAGCAAC CCTTGGCTTT
TCGCCAATTC TCATTCACTA TGGCCTGATG GCGGTGGTTT CAATTACCAC GGTTGGTGCA
TTTACGGCGG TTGGCGCGAT TTTGGTGGTG GCCTTGATGA TTGTGCCTGC GGCCACGGCC
TATTTGCTGA CCGACCGCCT GCCGTGGATG ATTGGCTTGG CGATTGCGGC TGGAGCGATT
GCCGCAGTTG CTGGCTTTGG CTTAGCCTTT ATGCTGAATG GCTCGGTTGC AGGCGCAATT
GCCACAGTTA CCGGAGTTGA ATTTGGCTTG GCCTTGTTGT TCAGCTCCAA ACAAGGCTTG
GTTGCCCGGG CACAGCGCTT GGCTCGCCAA CGAATTCAAT TTGCTGCCGA TACGCTCTTG
GTGCACCTGT TGCAACACGA AGGTAGCCCC GACGCTGCCC ATGAAAGCAC GATTGACCAT
TTGCAAAGCG AATTGCAATG GAATAACAAG TTTGCGCAAC GAGTGCTTGA TCTGGCTCAG
CGGCGAAGTT TGGTGCGCTG CCAAGGCCAA CAACTTGAAC TGACGACCAG TGGGCGTGAA
CGGAGTCAGC AAGTCTTGCA ATTAGGTCGT GGCTAG
 
Protein sequence
MSSTFAIILT GILVAAACAS LGCFLILRRM AMLADAISHA ILPGLVAGYF IAQGPSLLAG 
FFGAALAGLI TVGLVELLKN TQRVGGESAI GIVFPAMFAL GTFLVSKYYA DVHLDADAVL
YGNIEFVGFD HWYLGETDLG PQSLWIMGCL CLINLIFISV FYKELKLATF DAGLAATLGF
SPILIHYGLM AVVSITTVGA FTAVGAILVV ALMIVPAATA YLLTDRLPWM IGLAIAAGAI
AAVAGFGLAF MLNGSVAGAI ATVTGVEFGL ALLFSSKQGL VARAQRLARQ RIQFAADTLL
VHLLQHEGSP DAAHESTIDH LQSELQWNNK FAQRVLDLAQ RRSLVRCQGQ QLELTTSGRE
RSQQVLQLGR G