Gene Haur_1878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1878 
Symbol 
ID5733767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2237957 
End bp2239396 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content49% 
IMG OID641279022 
Productcondensation domain-containing protein 
Protein accessionYP_001544649 
Protein GI159898402 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTG AGAATTTAGA AGATATCTAT GGTTTGTCGC CTATGCAAAA GGGGATGCTT 
TTTCATACCC TGCTTGTACC CAATTCGAGT GCCTACCAAA ATCAATCAGT TTGGCAACTT
GAGGGCCAGC TGAATCTGGC AGCCTTTGAA TGGGCTTGGC AACAGGTTAT CCAGCGCCAT
TCAGTGCTGC GGACTGCCTT CTTTTGGGAA GATCTTGAGG AGCCGCTGCA AGTGGTGTTG
CGCCAAGTTA CGCTACCATT GCAGATCATG GATTGGCGGG AATACTCGGC TGAGCAGCAA
GCCCAACAAT TGGAGCCTTG GCTAGCGGCA GATTTAGCCC AAGGCTTTCA ATTAACTGCC
GCACCCTTGC TACGCTTGAG CTTGATTCAA ATCGGCCCGA CCAGCTATTG GTTTTGTTGG
AGCCGCCATC ATCTCTTGCT TGATGGTTGG TCGCAGGCGA TTGTGCTGAA GGATTTGTTT
ACCTTCTATG AGGTCTATTG CTATGGCGAG CAAGCCAGCT TTAATCAGCA GGCAGTTTTA
GGCCCACGCC GCCCGTATGG CGAATACATT GCTTGGCTTC AACAGCAGGA TCAGGCCAAG
GCGGAGGCTT TTTGGCAACA ACTGTTGAAT AATTGGGCGG GGCCAGCGCG ACTTAGTTTT
GCTCGTCAAG GTCGTTCGCA GCACAGTTAT GCTACCCAGC TGCTCCAGCT GGCAAGCGAA
TTAACTAGCC AAGTTCAAAT GGTGATGCAG CAGGCCGAAT TAACCATCAA TAGCTTAATT
CAGGGTGTGT GGGTTTGTCT CTTAGGTCAA TATAGCAATC AACATGATGT ACTGTTTGGG
GTGACGGTTT CAGGCCGTCC GCCGAGCCTG CCAGCAATTG AAAGCATGGT TGGTTTGTTT
ATCAATACGC TGCCGTTGCG AGCTCAAATT CAGCCTGAGC AGCTGTTTCT CGATTTGCTC
AAACAGGTGC AAAGCCAGCA GTTGGCCATG AGCCAATATG AGTATAGCTC GTTGGTTGAT
ATTCAAGGCT GGGCCAAATT GCCGCGTGAA CAAGCCATGT TCGAGACAGT CGTCGTGTTT
GAAAATTACC CGATGGATAC GGCGGCTTTT ACCCAGCACT CAAGCCTTAA GCTCGATTTG
CAGCGCACCT TTGTGCAAAA TAGCATGCCA TTGACCTTAC GGGCAATCCC AGGCGATCAG
CTAACCCTCG ATGTGCTCTA CGATACTGAG CGTTTTACCG TAACCCAGAT CGAACGAGTA
TTGCACGATT GTCAGTTGGT GTTGCAAGCC ATTGCTGCCA CGCCAATGAT TGCCGTTGCT
GAGATTATGC ACCATTTACA ACAAGCTGAA GACCAATTTC AACAATTGGA AGAGCAACGA
TTAAGAGATG CTAATGCTCA GAAACTCAAA ACGATCAAGC GCCGTTCGGT CGTTTCATAA
 
Protein sequence
MKVENLEDIY GLSPMQKGML FHTLLVPNSS AYQNQSVWQL EGQLNLAAFE WAWQQVIQRH 
SVLRTAFFWE DLEEPLQVVL RQVTLPLQIM DWREYSAEQQ AQQLEPWLAA DLAQGFQLTA
APLLRLSLIQ IGPTSYWFCW SRHHLLLDGW SQAIVLKDLF TFYEVYCYGE QASFNQQAVL
GPRRPYGEYI AWLQQQDQAK AEAFWQQLLN NWAGPARLSF ARQGRSQHSY ATQLLQLASE
LTSQVQMVMQ QAELTINSLI QGVWVCLLGQ YSNQHDVLFG VTVSGRPPSL PAIESMVGLF
INTLPLRAQI QPEQLFLDLL KQVQSQQLAM SQYEYSSLVD IQGWAKLPRE QAMFETVVVF
ENYPMDTAAF TQHSSLKLDL QRTFVQNSMP LTLRAIPGDQ LTLDVLYDTE RFTVTQIERV
LHDCQLVLQA IAATPMIAVA EIMHHLQQAE DQFQQLEEQR LRDANAQKLK TIKRRSVVS