Gene Haur_3133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3133 
Symbol 
ID5735005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3960337 
End bp3961608 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content49% 
IMG OID641280276 
ProductABC transporter related 
Protein accessionYP_001545898 
Protein GI159899651 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTATTT TCGAGAACGT TTCCAAGCAT TTTACGATCA AACGTGATCG ACGTAATAGT 
ATTCAGGAAC GGATCGCCGG TTTATTCAAG CCGCAAGTGC TGCCTGACGA GGATTTTTGG
GCGTTACGCG ATGTCAGTTT TACGATCAAT CGTGGCGAAA CGGTTGGCCT AATTGGCCAT
AATGGTTCAG GCAAATCGAC CACACTCAAG CTAATTACCC GTATTTTGCA GCCCAACAGC
GGCAAAGTAA TTGTCGATGG CCGGGTTTCG GCCTTGCTTG AGCTTGGTTC AGGCTTCCAC
CCCGACCTAA CAGGCCGCGA AAATATCTTC TTGAATGGCT CGCTGCTCGG CTTCAATCGG
GCCGAAATGG CCGAAAAAGT GCCAGCGATC ATTCGCTTCT CCGAGATGCA AGAATTTATC
GACATGCCCG TCAAGCACTA TTCATCGGGG ATGTATATGC GCTTGGGCTT TGCCGTGGCG
ATCAACGTCG ATCCCGATAT TTTGATCACC GACGAAGTGC TCGCGGTTGG CGATGAGTTG
TTTCAGCGCA AATGCTTGGA TCGGATTTAC CAACTCAAAC GCCGTGGCAA AACGATTTTG
TTCGTCTCGC ATGCCTTGGG CCAAGTGCGC GACTTATGCG ATCGGGCCTT GTGGTTTCAT
CATGGCAATT TAATGACCGA TAGCACACCC ACCGAAACAA TCGACAATTA TTTAGCCGAA
ACCAACCAAC GTGATGCTGA GCGGATCGAG GCCGAAAAAG CCGCCGAAAA CCCTGAACCA
GAAGCACCAA GTACTGAAAT CGAAACCAAG ACCGAAGAAG AAATCGAATT TGATGCTCGG
CGCTGGGGCA GTCGTGAGGC TGAGATCTAT AATGTAGAAT TGTTGAATGC TGCTGGCGAA
GTGATTCAAA CCGCTACCAC CGGCCAAGCT CTGACCATTC GCATGCACTA TCAAGCCCAC
CAACCAATTG AAAAACCGGT CTTTGGCGTG GCAATCCATC ATCGCACAGG CTTTCACATC
AACGGGCCAA ATTCGCGCTT TGCTGGGCTT GAAATTCCCC AAATCGCCGA ACAAGGCTAT
GTTGATTATC AGATCGAAAA TCTACCGTTG CTTGAAGGCA ACTATGAACT TTCGGTGGCG
TTGTATGACC ATACCTTGAC CCATCCGTAT GATCATCACG ATCGCAAACA CAATTTACGG
GTGTATGCGG CCTCAATTGG TGAACAATTT GGCACAATTT ATATTCCTTC GCAATGGTCT
TGGCATAAAT AA
 
Protein sequence
MIIFENVSKH FTIKRDRRNS IQERIAGLFK PQVLPDEDFW ALRDVSFTIN RGETVGLIGH 
NGSGKSTTLK LITRILQPNS GKVIVDGRVS ALLELGSGFH PDLTGRENIF LNGSLLGFNR
AEMAEKVPAI IRFSEMQEFI DMPVKHYSSG MYMRLGFAVA INVDPDILIT DEVLAVGDEL
FQRKCLDRIY QLKRRGKTIL FVSHALGQVR DLCDRALWFH HGNLMTDSTP TETIDNYLAE
TNQRDAERIE AEKAAENPEP EAPSTEIETK TEEEIEFDAR RWGSREAEIY NVELLNAAGE
VIQTATTGQA LTIRMHYQAH QPIEKPVFGV AIHHRTGFHI NGPNSRFAGL EIPQIAEQGY
VDYQIENLPL LEGNYELSVA LYDHTLTHPY DHHDRKHNLR VYAASIGEQF GTIYIPSQWS
WHK