Gene Haur_3350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3350 
Symbol 
ID5735220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4225553 
End bp4227196 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content50% 
IMG OID641280497 
ProductABC transporter related 
Protein accessionYP_001546114 
Protein GI159899867 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATTGC AGATTAACAA CCTCAACAAA GCCTATGGCC CACAGCAAAT CCTCAGCGAT 
ATCGCGTTAA TTATCAATCG TGGCGAACGA ATTGGCCTCG TTGGCCCAAA TGGCGTTGGT
AAATCGACCT TATTGCGCTT AATTATTGGT CAAGAGCAAG CCGATGCTGG CACAATTCGC
TGGGGCGAGG GCTGTGAATA TGGCTATTTA ACTCAGCAAT TAATCACCCC TAGCGAGCTG
AATGTTGAGC AATTACTGGC AGCTAGCCAA CAACAACTCA GCCAACTCGG CCAACAGCTT
GAGCAATTAA GCAACCAAAT GGCCCATGCC GACCCCGATC AACTTGCCGA TCTTTTAGAA
CGTTATGGCG ATGTGGCCGA GCGCTTTGAG TTGCGTGGTG GCTACGAACT GGATTACCGA
ATTGATCAGG TGCTTGCGGG CTTGGGCTTA AGCCATGTGC CCCGAGAACG CTCAGTCCAA
GCGCTATCGG GCGGCGAGAA AACCCGACTT GGTTTGGCCG CGCTGCTAAT TAGCAACCCC
GATGTGTTAT TGCTCGATGA ACCAACCAAT CACCTTGATC ATCAGGCTAG TGCATGGCTT
GAAACGTGGC TCCAAGCCCA TAACGGGGCA ATCTTGGTGG TTTCACACGA TCGAGCGTTT
CTTGATCAAG TGGCAACCAC AATTATTGAG CTTGATGAAC ATACCCATCA ACTCAAAACG
TATCCTGGTA ACTACAGCGC CTATTTTGCC GCCAAGCAAG CTGAACGCGA ACGCTGGGAA
GCTGATTATC AACGGCAGCA GGTCGAAATT CGCCAATTAC AAATGCGAGC CAAAGCCCAA
AATCAACAAG TTGCCCATAA TCGAGCGCCG CGTGATAACG ATGGTTTTAT TTATCATTCC
AAGGGCGAAA ATGTGGCGGC GGCGGTTTCA CGTAATCTGC GTTCAGCCCA ACAAGCGCTT
GAGCGCATTT TGGCAGATCC AATTCCTGAG CCACCCAAAC CACTGGCGAT CAACCCAACC
TTTAATCCCA GCCCCGATGG TAGCCAACAA ATGCTGTCGA TCGAAGGCGT GAGCTATCAG
CGTGAGCAAC AGCCAATCCT CGAACAGATC GATTTGGAAT TACGGCCCCG CCAACGCATC
CTGATTACGG GAGCCAATGG CAGCGGCAAA ACCACCTTGC TAGACCTGAT TGCCGGCGAT
TTGCAACCAA GCACAGGCCA AATTCGCTAT GGCCCAAACC TACAAATTGG CTATTTACGC
CAAGAATATC AGCGCCCCAA GCCTGAGCAA AGCCTATTTG AAGCCTATCG CGAAGGTTTG
CTGGGCTTTA ATAAAGATCT GATCAACGAG CTAGTTTGGT CAGGCTTATT CCGCTATGCT
GAAGTCAATC GGGCGGTTGG CAGCATTAGC ACAGGTCAGT TGCATAAGTT ACAATTAGCC
CGACTGATTG CCGCACGCGC TAATTTGTTG TTGCTTGATG AACCAACCAA CCACTTAAGT
TTTGATGTTT TAGAGCAATT CGAGGCTGCG CTCAATCAGT TTGCTGGGCC AATCATTGCG
GTTTCGCATG ATCGGCGCTT TATTCAGCAA TTTGCTGGCG AGATTTGGCA TTTACAACAG
GGACGGTTAA CGCGCCTATG CTGA
 
Protein sequence
MLLQINNLNK AYGPQQILSD IALIINRGER IGLVGPNGVG KSTLLRLIIG QEQADAGTIR 
WGEGCEYGYL TQQLITPSEL NVEQLLAASQ QQLSQLGQQL EQLSNQMAHA DPDQLADLLE
RYGDVAERFE LRGGYELDYR IDQVLAGLGL SHVPRERSVQ ALSGGEKTRL GLAALLISNP
DVLLLDEPTN HLDHQASAWL ETWLQAHNGA ILVVSHDRAF LDQVATTIIE LDEHTHQLKT
YPGNYSAYFA AKQAERERWE ADYQRQQVEI RQLQMRAKAQ NQQVAHNRAP RDNDGFIYHS
KGENVAAAVS RNLRSAQQAL ERILADPIPE PPKPLAINPT FNPSPDGSQQ MLSIEGVSYQ
REQQPILEQI DLELRPRQRI LITGANGSGK TTLLDLIAGD LQPSTGQIRY GPNLQIGYLR
QEYQRPKPEQ SLFEAYREGL LGFNKDLINE LVWSGLFRYA EVNRAVGSIS TGQLHKLQLA
RLIAARANLL LLDEPTNHLS FDVLEQFEAA LNQFAGPIIA VSHDRRFIQQ FAGEIWHLQQ
GRLTRLC