Gene Haur_2197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2197 
Symbol 
ID5734084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2788317 
End bp2790902 
Gene Length2586 bp 
Protein Length861 aa 
Translation table11 
GC content50% 
IMG OID641279338 
Productlipid transport protein 
Protein accessionYP_001544965 
Protein GI159898718 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000284103 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTACC CACGTGTGAA ACGCTGGTTA TCGATGTTAA TCGGGACGGT CTTGCTCGCT 
CAATTGGTGC TAATTCCGCA CCATGCTACG GCTCAAACCC GCTCGTCTGG ACGCGAACCC
GTGGTGGTTG CAACTGAGAG TAAAGGCTAC TATCGCTATC CAGATATGGG CATCTATCGC
TATGCCTGGT CATCAACGGT GGATACGCAA TCAACGACTC AAACGGCGGC TACTCGACCA
CGCCAAGATG TCAATCGCTT GGGTATGACT GGGGTTGTCG AAATTGAGCG CTACGCCAGT
GGCCGCAGCA CCAATCTGTT GCGAGCTAAA CTGGTTAATC CTAAGATGTA TACCATCACC
GAAACAGGCT GGCAGTTGGT TACTGATCGT GAGGTAGTTG GCCCAGACTT CGACAAACAA
GTTGAAGTTC CTTTCTATTT TCAACAACAT ACGACTGGTG AAGTTGTCGG CATGTATTTT
GATCGGACAG ATAGCGAAGA AGTACGAAAT ATGAAACGGG CGGCGGTGTC GCAGTTGGCG
ATGCAGCTTG AATACAGCCC CACACCAAGC TGTGATGATC GCACGATTAC GTGTGCTCAG
CCTTACAAAC GGCTAGAAAC TGATGTAACG GGAACCTACA CAGCCGCCTA TGCTGCGCGG
ATTGTAGATA GCCAGTATGT GGAAATTACG AAAACGCGAG ATCAAGATTC ATATACGGCC
TTTGCTGATC GCACGATTGT TGATGCTCGT GATACCCTGC TCTCCAGTAA ACATACGGCG
ATGTATGATC TACGGGCCGG CGTATTGGTG CGCAATAGCA ATCAACAAAC CATTCGCAAT
CGCTTGGGGA GTGCCGAGCA AGCTCAGTAT GCTGGCTCTG GCTATGGCAT CGAATCGGGA
ATTAATGCCA ATGAGAGCAT AAGCCTTGAT GGTGTTACCA CAGGCGCATT GGATTCGGTA
ATCAGTTTAT CGGGCGATGA ACAGACCAGT GAATTGGCTG CCCTCATGCG GCGCACCGAG
GGGCGCGATT ATGTTGCTAC ACCATTAGTA GCGACCATTG TTGAGCAGGC TGCTGTTGCT
GATCGCGGTT TGGCGGCGAA TGACTTAACC AGTGCTTTGG CTCTTGTAGC CCAAAATCCT
CACGATCCTA ACAATGTGAT CGTGCTGCGG GCGACGCTTA ATACGCTTGA TCGAAGTATG
CAGCAGCTTG ATCAACGCTT GAGCAAAGGG ACAATCGCCA CCAACCTCTA TGAACCTTTA
ATTGGCGCTC TGACTGGAGT GATTGATCCC CAAGCCCAAG CGTTGGTGAT TAAACACTTT
ATTCAATCGC CTAGCGTTGC AGCGTCAATT CGTACCCAAG CCTTGACTGC CTTGACGATC
TTCAAAAAGC CTAGCCTAGA AACGATTCGC TTTGTTACAA CCTTGGCTGA TCAAAATACT
CCTGAGGGAA TGCAAGCATT GCTGGTGCTT GGTGCTATAG CAGGTACGAT TCAACACGAA
CAGCCAGCCC AAGCTCGCGA TCTAGCGACG ATCATTGAAG CAACATTGAC CCATGCCAAG
AGCGATACTG AGCGTGATTT GGCCTTACGC GCTTTGGGCA ATGCCGGAAC CGCGACTGAT
CTTGCAGTTA TTCGTCCATA CCTGGCGGAT GCCAACCAGA TTGTTCGTAC CTCTGCAATT
GATGCCCTGC GCAAATTTCC AGCAGCCGAT ACCAACGCGC TTCTCAACAC CGCGTATCGT
AGCGATACAA GCGAGATTGT CCGTCATACG GCGCGTGAAT TGCTGTATGC CAACGGTGAT
AGCCCAAGTT TGAATGCTTT CGATTGGAAT TGGCAACAAT TTATTGGCGG CGGTGATCTC
AAGGGCGAAT TGAAATCACG AGTATATGTG AGCGATGGCC CCGATATTAC AGCCTTGGCC
CGAGGCGAAG CGAAAGCTCA TGCTTGGTCG TGGAGCTATA CACTGGCCGA AGCCCAAGCC
TCAACCTATG TTCAGACCGA GCATTCAATT AAATATCGCT ACTTTGAAGC GTATGTCAAG
GTCTTGGGTA ACAATGTCTT TACCCCAATC AAAGAACGCC TGCAATGTGG GGTTGAGCGC
ACAGGCAACT TATATCAAAC GACGATCAAT TTCTTCTCGC TTACGAAAAC CTTTATGGTC
GGGCCAGTGC CTGTTCAACT TGGCTTGACC GCCAGTGGAA CGATCTCAAT TCCTTGGAAA
ATCGTGGCAA GTGCCTGTGA TGTGCCGATT TCGGCCAATG CCAATATCTC GATTACGCCA
ACGGTCTGGG CTTCAGCCAG CGCAACGGCT GCCGTTACGA TCTTTGTCGC TCGTGGTGGG
GTTGGAATTA CCGCTGATTT CTTGAAGACC GGCATCGAAG CGAAGGCCAG TGCATCCTAT
CATATTATCA ATGGTTTTCA CGGTAGTATT AATCTGAATG TCTCACTTCA GCCGATGGCC
GTTCGCATCT TTTTGTGGTA TCAATTGCGC AAATTGAATG GAAGCTGGAA ACCACGTAAC
GAGTGGACGC TTTGGAATTG GAGTGCTCCA ACCCAAACTT GGCCGATTTG GAATCATAGT
TTCTAA
 
Protein sequence
MIYPRVKRWL SMLIGTVLLA QLVLIPHHAT AQTRSSGREP VVVATESKGY YRYPDMGIYR 
YAWSSTVDTQ STTQTAATRP RQDVNRLGMT GVVEIERYAS GRSTNLLRAK LVNPKMYTIT
ETGWQLVTDR EVVGPDFDKQ VEVPFYFQQH TTGEVVGMYF DRTDSEEVRN MKRAAVSQLA
MQLEYSPTPS CDDRTITCAQ PYKRLETDVT GTYTAAYAAR IVDSQYVEIT KTRDQDSYTA
FADRTIVDAR DTLLSSKHTA MYDLRAGVLV RNSNQQTIRN RLGSAEQAQY AGSGYGIESG
INANESISLD GVTTGALDSV ISLSGDEQTS ELAALMRRTE GRDYVATPLV ATIVEQAAVA
DRGLAANDLT SALALVAQNP HDPNNVIVLR ATLNTLDRSM QQLDQRLSKG TIATNLYEPL
IGALTGVIDP QAQALVIKHF IQSPSVAASI RTQALTALTI FKKPSLETIR FVTTLADQNT
PEGMQALLVL GAIAGTIQHE QPAQARDLAT IIEATLTHAK SDTERDLALR ALGNAGTATD
LAVIRPYLAD ANQIVRTSAI DALRKFPAAD TNALLNTAYR SDTSEIVRHT ARELLYANGD
SPSLNAFDWN WQQFIGGGDL KGELKSRVYV SDGPDITALA RGEAKAHAWS WSYTLAEAQA
STYVQTEHSI KYRYFEAYVK VLGNNVFTPI KERLQCGVER TGNLYQTTIN FFSLTKTFMV
GPVPVQLGLT ASGTISIPWK IVASACDVPI SANANISITP TVWASASATA AVTIFVARGG
VGITADFLKT GIEAKASASY HIINGFHGSI NLNVSLQPMA VRIFLWYQLR KLNGSWKPRN
EWTLWNWSAP TQTWPIWNHS F