Gene Haur_1848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1848 
Symbol 
ID5733737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2148575 
End bp2150212 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content50% 
IMG OID641278992 
ProductYidE/YbjL duplication 
Protein accessionYP_001544619 
Protein GI159898372 
COG category[R] General function prediction only 
COG ID[COG2985] Predicted permease 
TIGRFAM ID[TIGR01625] AspT/YidE/YbjL antiporter duplication domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGGACT TGCTAGCGTC GAATCCACTA TTGCTGCTGT TTGTAGTTGC AGCAATCGGC 
TATCCTTTAG GGCAATTGAA TCTCGGCGGT GTACGCCTCG GCGTTGCCGC CGTTTTGTTT
GTGGGCTTGG CAATTGGCTC GCTCGACGAG CGCCTAAAAC TGCCCGAAGT GCTGTATCAA
TTTGGCTTGG TGATGTTTGT TTACACCGTC GGGTTGAGCA GCGGCCATCA GTTTTTTCGC
TCGTGGAAGA GCAAAGGCTT ACGCGATAAT CTCTTTATTG GTGGGATGAT CGTCGTTGCC
TATCTGTTTG CAATTCTGGC CCATTCTTTC TTTAGCATCA AACCAACCTT GACCGTTGGC
CTGTTTACCG GCACAATCAC TAACACGCCA GCGCTGGCTG CGGCCATCGA ATATTTGAAA
TCGGTCTTGC CCGAGCAAGA ATTAGCCGCT GTGGTCAATG ATCCAGTTGT CGGTTATTCG
ATTGCCTACC CCATGGGTGT GGTGGCGATG ATGTTGGCAA TTGTTTTTGT CCAACGGTTG
TGGCGCATCG ATTATGTCAA AGAATTAGCC TCACGCCATG ATTTAGCTGG TAATCATCAC
GATATTACCA GCCGCACGGT TGAAATTACC AATCCTGAAG TCACTAATCT ATCAGTTCAG
GCCTTGATTG CCAAATATCA TTGGCCCATG GTTTTTGGGC GCTATCGCCG CGATGGCCAT
GTAGCGGTCA CCACTGGCGC AACCCAATTT ATGCTCGGCG ATCATGTGAG CATTGTTGGG
GCCAGCGAGG TGCTCGAAAA AGCCCTAGCC GTGCTTGGCA AAGAAACCGA CGATATTTCC
AACGACCGCC GCGATCTTGA TTATCGCCGC ATGTTCGTCT CTAACCCCAA AATTGTTGGA
ATTCGGCTGG AGCAGCTCAA TTTGCCCCAA GTCTTGGGGG CAACCATTAC CCGGGTGCGT
CGTGGCGATG TGGAAATGTT GCCAACCGCC GATACCCGCT TGGAGCTTGG CGATCGGGTG
CGGGTGATTG CGCGGCGTGC TGATATTCCT CAAGTGAGCC GCTTTTTTGG CGATTCGTAT
CGCGCCTTGA GCGAAGTTGA TATTTTGACC TTCACTTTGG GCTTAGTCTT AGGCTTGGTT
GTGGGTTCGT TGGTGTTGCC ATTACCAGGT GGCGTGAGCA TCAAACTTGG CTTTGCTGGT
GGTCCATTGG TAGTTTCGTT GATTTTGGGC GCACTCGACC GCACTGGCCC ATTGGTTTGG
AACATGCCCT ATAGCACCAA CTTAACCCTG CGCCAAATTG GGATTGTGAT GTTCTTGGCG
GGGGTTGGCA CCCGCGCAGG CTACGATTTC GTCAAATTCT TATTCAGCAG TAATGGCTTG
CTCTTGTTTG GAGTTGGCGC AGCAATTACC TTTAGCCTCG CCATGTTGAT TTTGACGATT
GGCTACAAAG TGCTGAAAAT CCCAATGGGC ATTTTGACGG GGATGTTGGC GGGCTTTCAA
ACCCAGCCAG CCTTGCTCAG TTTTGCGCTG CAACAAAGCA ACAATGAGCT GCCAAATCAG
GGCTATGCTG CGGTTTATCC ATTAGCATTA ATTATCAAAA TCATTCTGGC CCAATTATTA
CTGATTCAAC TTCTATGA
 
Protein sequence
MLDLLASNPL LLLFVVAAIG YPLGQLNLGG VRLGVAAVLF VGLAIGSLDE RLKLPEVLYQ 
FGLVMFVYTV GLSSGHQFFR SWKSKGLRDN LFIGGMIVVA YLFAILAHSF FSIKPTLTVG
LFTGTITNTP ALAAAIEYLK SVLPEQELAA VVNDPVVGYS IAYPMGVVAM MLAIVFVQRL
WRIDYVKELA SRHDLAGNHH DITSRTVEIT NPEVTNLSVQ ALIAKYHWPM VFGRYRRDGH
VAVTTGATQF MLGDHVSIVG ASEVLEKALA VLGKETDDIS NDRRDLDYRR MFVSNPKIVG
IRLEQLNLPQ VLGATITRVR RGDVEMLPTA DTRLELGDRV RVIARRADIP QVSRFFGDSY
RALSEVDILT FTLGLVLGLV VGSLVLPLPG GVSIKLGFAG GPLVVSLILG ALDRTGPLVW
NMPYSTNLTL RQIGIVMFLA GVGTRAGYDF VKFLFSSNGL LLFGVGAAIT FSLAMLILTI
GYKVLKIPMG ILTGMLAGFQ TQPALLSFAL QQSNNELPNQ GYAAVYPLAL IIKIILAQLL
LIQLL