Gene Haur_1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1990 
Symbol 
ID5733879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2450884 
End bp2453481 
Gene Length2598 bp 
Protein Length865 aa 
Translation table11 
GC content51% 
IMG OID641279134 
Producttail collar domain-containing protein 
Protein accessionYP_001544761 
Protein GI159898514 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAATC CATTTGCGCA AGCATTAACT CTTTATTTAC CACTTGATAG CATGGGCATC 
AACAACACCA GCGTCAGCGA TCTTTCGGGC AACCGTAATC ATGGCACAAT TCACGGCAAT
GTGATGGTTG TGCCTGATGA TCAGGTTGGC AGTTGCGCCT GTTTTGATGG TCAAAGTTGG
GTCGAGTTAG CCAATCCATT TGCCAGTGCC AGCGATTTTA CGCTGGCGTT GTGGGTGCGG
CCAACCCGCT TCGATGGAGC CTACCATGGT TTTATTGGCA AGCAAGCCGC CGAAGATTTG
TATCGCAAAC CAAGTATGTG GGTGATGGGT GATGGTGGTT TGCATCTTGA TAGTTACTCG
TCTGATGGAA CTCGCTTTCA TTATGAATTA GCAGGCTTTT TTGCTCAGCC AAATGAGTGG
GTGCATGTGG CCTGGGTTAA ATCGGGCACG GCCTATACAA TCTATCGCAA TGGGGTGGCC
TTTACCGAAC GACCAGCCCC TGCCGAAGTC TATGTGCCAG CCAGTAGCTA TTGGCTTGGC
AAAGTCGATA ATTTGTTTGA TGGATGCTTG GCGCATGTGC GAATGTACAA TCAGGCGCTT
GATCCTGCTG CGGTGGCCGA TATTGCCGCC CATGATCGGG TCGCTCGCAT GGTGTTTCGG
GCAAGTTATC CGCTTGATTT CAATTTGCTG AATTCGCAGC AAGAGCCAAA TCTCAACCCT
GGCACAAACC CATTGACCTT GACCCTGACC AATGCCAGCG CCCAAAGCAT CGAATTATCG
CCGCTTGATC GCAGTGCTCC AGCGCAACAG CAACACTTCA GTTTTAGTTT TCGCCCCAAT
TTATTGGCAA TTAACACTGG AATTGCAATT GATCATCCGG CTTGGCAGGT TAGCACTCAG
TCAATGAGCG ATGGCCGCAT GAATATCCTT GTTCGCTCGA CCGAGGCTCA AACGTTAACT
CCCAATCAAA CCTTGCGCTT TGCACTGAGT GGGATTACGG TTTTGCCTCA AGATGGCAGC
CACTCAACCC AAATTGAAAT GCAATATAGT AATTTACGCT ATGTTGGTGA AATAAGTCTG
TTAAACGGAA GTCGTCTGCA ACGCATTAGT ATTAGTACCG ATGATAGTGC CTTGGATTTG
CCGTTGCACC TGAGTTTGAG CAACGGTGCA ACAATTCTCA ACACCAACCA GCCCAACCAT
TTAGTTGCCC GAATAAGTAA CACCTCAACC CATAGCACAT TGCACTTTAA TCAATCCGAG
CCACAGAGCC ACTTAATTGT GCGGTTTGAT GGCAGTGCCA GTGCTGAGCC ATGGGCCTTG
GCAACGCCCG ACCAAATTAA TGCGATTACG ATTGAGGTAG CGGGCTGGGA TGTGCAACGT
CAACAGCAAG CTGGTCAAAC GACGTGGGTT TGTCGTCCAC TCAGCGATGT GGCTTTGGCT
CCAGGCGCAG CGCTTGACTT ACAGATCAAC AATATCGTGA CGACCCACCC GCAAGGCAAT
ACGACCCTGT ATGTTGTGGT GCACGAGCTG CAAGGTTTCA ACGATACCAC CTTAACGACG
ACAATTACCA AAACCTCGAT GAACACTATT AGTAACGCTG GGCAAACTCA AAATACGCTT
GCACTTGGCG AAAATGGCTT TATCAGCGGA GCGGGCTACA ACACCTTGGT AGCTCAAACG
ACACTGAGCG GTGGTGGTCG AATTAGCTGG CGCAACCGTA AAGTTCGCTG GACGCAGCGC
TTTTTGGCGA TTAGTATGGG GCATACTGGC TTTCCGGTTG GACACTTTAA TATTGCATAT
CCAACTGCAC CAATTCCTGC TGCCGATTGC TACGATAACC TCGAACGACC AGTGAGTGAT
GGCATCGAAT TGCGTGATTG GGAAGCGTTA TATGCGATTT ATACTCCCAG CACCAGCCCC
AGTACCACCA GTTTACGAAT TGTGCATTAT GCCAAGCCTT TCAATCTTGA AGGCCGGGCA
GTGCTGGTGG CCGTTTTTAA TGCTGATGAT CGCACCTTGA AGCTTGGCTC TGGCCTGACG
TTGAGCCATC AAGGAACCTA TTCCAATGGC AGCCCAATTC CATGTGGCAC CATTCAAATG
TGGTCGGGCA TGGAAGTGCC TGAAGGCTGG GCGATTTGTG ATGGCCGCGA AGCCAACGGC
TTGCGCACCC CTGATTTGCG CAACCGCTTT ATTGTTGGAG CTGGGGCCAA TTACGATAGT
GGCAACCTCA GTGTTTATGG TACGAATCAA GGTACAACTG GCGGCAGCGA TGTAGTGGCA
TTAACCCTCG ATCAAATGCC GCGCCACACC CATGGCGGTT CAACCAATGC CGCAGGCGAC
CATAGCCATT GGGTTGAAGG CACTGATGCC GATGGCTTAG CCAAACGTCG CCGTCACCAT
TGGGGCGATA CTACCGTCGA TATGGGTTTT GGTGGTGGCC GCAACGCCGA CCCTAACGAT
GAACGCTGGC GTGGCCGGGT CAATACCGAT AATGCTGGTA CCCATAGCCA CGGCCTGATG
ATTGGTGAGG TTGGTGGTAG CCAAGCCCAC GAAAATCGCC CGCCATTCTA TGCGCTCGCC
TTCATTATGA AAGTTTAA
 
Protein sequence
MTNPFAQALT LYLPLDSMGI NNTSVSDLSG NRNHGTIHGN VMVVPDDQVG SCACFDGQSW 
VELANPFASA SDFTLALWVR PTRFDGAYHG FIGKQAAEDL YRKPSMWVMG DGGLHLDSYS
SDGTRFHYEL AGFFAQPNEW VHVAWVKSGT AYTIYRNGVA FTERPAPAEV YVPASSYWLG
KVDNLFDGCL AHVRMYNQAL DPAAVADIAA HDRVARMVFR ASYPLDFNLL NSQQEPNLNP
GTNPLTLTLT NASAQSIELS PLDRSAPAQQ QHFSFSFRPN LLAINTGIAI DHPAWQVSTQ
SMSDGRMNIL VRSTEAQTLT PNQTLRFALS GITVLPQDGS HSTQIEMQYS NLRYVGEISL
LNGSRLQRIS ISTDDSALDL PLHLSLSNGA TILNTNQPNH LVARISNTST HSTLHFNQSE
PQSHLIVRFD GSASAEPWAL ATPDQINAIT IEVAGWDVQR QQQAGQTTWV CRPLSDVALA
PGAALDLQIN NIVTTHPQGN TTLYVVVHEL QGFNDTTLTT TITKTSMNTI SNAGQTQNTL
ALGENGFISG AGYNTLVAQT TLSGGGRISW RNRKVRWTQR FLAISMGHTG FPVGHFNIAY
PTAPIPAADC YDNLERPVSD GIELRDWEAL YAIYTPSTSP STTSLRIVHY AKPFNLEGRA
VLVAVFNADD RTLKLGSGLT LSHQGTYSNG SPIPCGTIQM WSGMEVPEGW AICDGREANG
LRTPDLRNRF IVGAGANYDS GNLSVYGTNQ GTTGGSDVVA LTLDQMPRHT HGGSTNAAGD
HSHWVEGTDA DGLAKRRRHH WGDTTVDMGF GGGRNADPND ERWRGRVNTD NAGTHSHGLM
IGEVGGSQAH ENRPPFYALA FIMKV