Gene Haur_3127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3127 
Symbol 
ID5734999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3947286 
End bp3949175 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content52% 
IMG OID641280270 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001545892 
Protein GI159899645 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATAG CGACATTAGC GCCGATCACC GCCGACCGCA CAGGCACAGT TTTTGTGCAT 
GATGTTATCA GCACGCACGC CCAGCATTCG CCACAGGCTA TTGCCATTGC TACCAGCACA
TTCAAGTTGA GTTACGCTGA ATTCGAACAG CGCACCAATC AGCTTGCCCA TTATTTACAC
CGCCAAGGAG TGCATCGTGG CCACACAGTT GGCGCGTGCT TTGAGCGCTC AGTCGAGGCA
ATGATCGCCG CCGTGGCGAT TTGGAAGGCT GGCGCAGTCT ACTTGCCGCT TGATCCAGGC
TATCCCCAAG AACGGCTTAA ATATATGTTG GGCAATAGTG GCGCGAGTTT GGTACTGGCA
ACCCAACTCA CCGCCAGCCA GTTTCCTGAG CAACAATTGC ATATTTTTGA GCAATTAGCC
GCTGAGCTTG CACAGCAACC AAGCCATGCC CCTGAACATC AACTTACACC CGACGATCTG
GCCTATATCA TTTATACCTC TGGCTCAACT GGCAAGCCCA AAGGCGTGCT CGTGCCGCAT
CGCGGTTTGG CCAATTTGGC GGCTGCTCAA ACCGAGCGCT TTGGCATCAA CAGCCAATCA
CGAATTTTGC AGTTTGCCTC GCCAAGCTTC GATGCTTCGA TTTCTGAGAT GCTGACGGCT
TTTTTTCAGG CTACAACTTT GTTTGTAGCC CCAACGAACG ATCTTTTGCC CGGCCCAGAC
TTATTGACAA CCCTGCGTGA TCACCACATC ACGGTTGCCA CCCTGCCGCC TTCGGTGCTG
GCGTTGCTCG ATCCACGCGA CTTGCCCAAT TTACAAACCA TCGTTTCGGC AGGCGAGGCT
TGCACCGCTG AAATTGTGGC CCGTTGGGGG ACAAACCGCC GTTTTATCAA TGCCTATGGC
CCGACCGAAG TCACCGTTTG CGCCACCATG AGCCAGTCTT TACGCTACGG TATGGCCGTC
AGCATTGGCA ACGCTATCAG CAATAGCCAA ACCTATATCG TTGATGAGCA CTTAAATTTG
GTCGAAGGCG AGGCGGTTGG CGAATTATTG GTCAGCAGTG TTGGCTTAGC CCATGGCTAC
CTTGGCCTTG GCGATCAAAC TGCCGAGCGC TTTTTGCCCA ATCCATGGAG CGACCAAGCG
GGCAGTCGGA TGTATCGCAC TGGCGATTTG GTGCGGCGCT TGAGCGATGG CAGCCTTGAG
TTTCGCGGAC GCATTGATCA TCAAATTAAG CATCGCGGCT ATCGCATCGA TCCAGGCGAA
ATTGAAATGC TCTTGATGGA ATATCCTAAT GTGCGCCACG CCGTCGTCAC GCTGCATCAC
GACCATAACC AGACCGAGCG GTTGGTCTCG TATTTGGTGT TACACGGCGA AGTTATGCCC
TACTATCGCG ATATTTACCG CTATTTGGAG AGCATGCTGC CCAAATATAT GGTACCGCTC
TCGTATACGG TCGTGAAAGA ATTGCCACGC ACGCCCAATG GCAAACTCGA TTTAGCAGCC
TTGCCTGAGC CAGATTTTGC CTTGCTGACC GTTAGTGAAA ATTATGTTGC CCCACGCACA
CCGCTTGAAC AGCAGATCGC CGCGATTTGG GAAAATATTT TGGATACTCC CAATATTGGT
GTGCTTGATG ATTTCTTTGA TGCTGGCGGC CACTCGCTGT TAGCAACCCA AATTGTTTCA
GCGATTCGCA GCACGTTTGC GGTGGAAATT CCACTTTCAG TCTTGCTTGG CGTTGAGCCA
ACCATCGCCG CCACGGCCCA ATTGATCGAG CAATATCAAA TTGCTAATGC CGATGATGCC
GAACTCGCCG ACCTGTTGAA CGAACTTGAT GGCCTCTCCG ATGAAGAAAT TCAAGCCTTG
TTAGCTGATG AAGGAGCGCT CACCGCATGA
 
Protein sequence
MSIATLAPIT ADRTGTVFVH DVISTHAQHS PQAIAIATST FKLSYAEFEQ RTNQLAHYLH 
RQGVHRGHTV GACFERSVEA MIAAVAIWKA GAVYLPLDPG YPQERLKYML GNSGASLVLA
TQLTASQFPE QQLHIFEQLA AELAQQPSHA PEHQLTPDDL AYIIYTSGST GKPKGVLVPH
RGLANLAAAQ TERFGINSQS RILQFASPSF DASISEMLTA FFQATTLFVA PTNDLLPGPD
LLTTLRDHHI TVATLPPSVL ALLDPRDLPN LQTIVSAGEA CTAEIVARWG TNRRFINAYG
PTEVTVCATM SQSLRYGMAV SIGNAISNSQ TYIVDEHLNL VEGEAVGELL VSSVGLAHGY
LGLGDQTAER FLPNPWSDQA GSRMYRTGDL VRRLSDGSLE FRGRIDHQIK HRGYRIDPGE
IEMLLMEYPN VRHAVVTLHH DHNQTERLVS YLVLHGEVMP YYRDIYRYLE SMLPKYMVPL
SYTVVKELPR TPNGKLDLAA LPEPDFALLT VSENYVAPRT PLEQQIAAIW ENILDTPNIG
VLDDFFDAGG HSLLATQIVS AIRSTFAVEI PLSVLLGVEP TIAATAQLIE QYQIANADDA
ELADLLNELD GLSDEEIQAL LADEGALTA