Gene Haur_4229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4229 
Symbol 
ID5736083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5390876 
End bp5392129 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content46% 
IMG OID641281384 
Productphospholipase D/transphosphatidylase 
Protein accessionYP_001546989 
Protein GI159900742 
COG category[I] Lipid transport and metabolism 
COG ID[COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTGGC TCAAAGCTGG GTTATGGTTT CTGACTGAGT GGGTGCTTTC GCTGCCACGT 
TTGCTGCTGG CCAGCATTCT GGCTCAATTG GGCTTGATGG GTGCGCTGAT GGCTTACGCT
AAGTTGCGCA ATCGCAGCGT CGAAAATAGC TTTCCCCAGC TTGATTTGCC GCCAATCGAT
TTGGGCGAGC ATAGCGTGCA ATTATTTGAT GATGGTGGCT CAGTCTTTCG CCAAATGCTG
TTTGATATTT CCCTAGCCAA AGAATCGATT TTGCTCGAAA GCTATATTTT CGAGCGTGAT
GAGGTTGGGC TAGCCTTCAA ACGGGCATTA ATTCGCAAAG CCCGCGAAGG TGTGAAAATT
TATGTAACCT TCGATGGCAT TGGCACGTTG CATGTGCCCG AACGCTTTAA GTTATTTGGC
AAGCGCCGCA ATATTCGGGT CTTTGAATTT GGCCGCCTCA AATCAATTCG CTCATTTTTC
GATACGAACA TGTGGATTCG CACCCATCGC AAAATTTTGG TGATCGATGG CAAAATTGGC
TATTTAGGCG GCATGAATAT TGGCCGCAAC TATGCCCGAA CTTGGCGTGA TACTCATTTA
AAAATTGAAG GCCCAATCGC AGCCAATATT GCCGAAGAAT TTATCGCTCT GTGGAATAAA
TACAATAAAA AACATAAAAT CAATTTAAGT TACGCAACTA CTAGCGATCA ACAAGTGGGA
ATTTGTGCCA ACGACCCAAT TGAATATCAA CTGCCGATTC GCCAAACCTA TCTTGATGCA
ATTAATGATG CCAAACACTA TATTTACATT TCAAATGCCT ATTTCTTGCC CGACCCGCTG
ATTCGAGCAA CGCTGATTGA TGCAGCCAAA CGCGGGGTCG ATATTCAAAT TATGGTGCCC
GAAATCTCCG ATAATATCGT AGTCGATTGG ATTTGTCGCG GTTTATTGGG AGAATTAATG
GAGCATGGCG CACGGGTGCT GCTCTATCGT GGCACCATGA TTCACGCCAA AACCATGACC
GTTGATGGGT TGTGGTCAAC CGTGGGTAGC GCTAATTTAG ATACCCGCAG TTTGGCCGCG
AATTACGAAA TCAACGCCAC GATCAAGCAT CCAGCTTTTG CCCGCCAAAT GGAAGCCATG
TTTTTGAATG ATCGAGCGCA ATCGCGTGAA ATTAATTATC AAACCTGGCG TAGTCGTTCG
ATTTTTATTC ACCTCGGCGA GCAAGTGCTG CAACCAGCGC GAGGCCTGTT TTAG
 
Protein sequence
MRWLKAGLWF LTEWVLSLPR LLLASILAQL GLMGALMAYA KLRNRSVENS FPQLDLPPID 
LGEHSVQLFD DGGSVFRQML FDISLAKESI LLESYIFERD EVGLAFKRAL IRKAREGVKI
YVTFDGIGTL HVPERFKLFG KRRNIRVFEF GRLKSIRSFF DTNMWIRTHR KILVIDGKIG
YLGGMNIGRN YARTWRDTHL KIEGPIAANI AEEFIALWNK YNKKHKINLS YATTSDQQVG
ICANDPIEYQ LPIRQTYLDA INDAKHYIYI SNAYFLPDPL IRATLIDAAK RGVDIQIMVP
EISDNIVVDW ICRGLLGELM EHGARVLLYR GTMIHAKTMT VDGLWSTVGS ANLDTRSLAA
NYEINATIKH PAFARQMEAM FLNDRAQSRE INYQTWRSRS IFIHLGEQVL QPARGLF