Gene Haur_5240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5240 
Symbol 
ID5737198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp6308 
End bp8647 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content59% 
IMG OID641282404 
ProductType IV secretory pathway VirB4 protein-like 
Protein accessionYP_001547995 
Protein GI159901750 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3451] Type IV secretory pathway, VirB4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAGC AGCCGCTGTG TTTGACGATT GACCCCTTTT CGGTTCGCCA GTACCGTGAG 
GAATTGCCCC AGCTGGAGCA GCGCTTTGCC AACTTTTGGG CAGGCATCAC CTACGATGCA
CGGCTGATCT CGTGTACGCG GCGCTTTTCG TTTGCGCCGA TTCGCCAGCG CTTGCGCCAA
CAAACCAGCC CGCTTGACGA CCTGCGCGAT CTCATGCCCA TGCTGGTGTC AGCGCAGGAG
GATGGAAGCC GAAACGCGGC ATTAACCAGC TTGGTCCAGA AGCGGCTCGC AACCTATGAA
CGGGCGACTG CGGCATTACA GGATGCGCCA GCGGTGCATG CTGCTTTCCG CGCGATGGCC
ACGGGTGCGA GCGATGCCAC GACGCTGGCC GTGGTCGCCG ATGGCTGCCG CCGTGCCCTT
TGGCCGTGGC GGTGGCTCAA GAATTATCGG CGGGCGTATG AGGTCATGGA GCGCGAAGGC
AATCCCCTTG GCATTCAGCA TTACTTTGTG GCCTGGCCGT CCGAGTATAC GGATGCCGAG
GCGGTGCGCA GTGTGCTTAA GGGCACGTTC TTATTGCCCG ATGTCCAGAG CGCACCGCTA
CCACCCTTGT TTCATGGGAA ATACCGCGAA ATGGCGACCT ACTTGACCCC GCTCGACGAG
GGTCGTCCCT ACTTAAGGGT GATTCACGCC TTTGATGTGC GGGGTGAGTG GGATTTGGGC
AGTATGCAGG AGCTTTTAGG CGGGGAAGAA GAACTGGCCG TCGCGCTGGA TGTCACCACC
TTGCCCCGCG CCAAGGCCCA ACGCGCGACG ACCGATGCCT TTAATGTGCT TGAGGGTGCG
TTGACGGCGC GGAATGCCGT GAAGGACTCG CGCAGCGAAC GCGCCTACCG CGATGTCAGC
TACGCCATGA ACCAACTCGA TGTCCAGCAG TTGCACGAGG TGGCCTATGC GATCCTGATT
CAAGCACCGA GCGTGCGGGA CTTGAATCGC ATCACCCAAA CCCTGCGTGA CCGCATGGGC
GCACGGCTCA AACTGGATGT GCTGAGCGGA ACCCAAGGCG AATACCTCAA ATTGTTCACG
ACCACGCCGA GCAAGCAGAT TGCCGCGCCG CTCATCCGGC GGAATGCCTT GAGTGAACAC
GTCGCGGCCA AAACGCCCTG GGGGATTCGC AAGAGCGATG CGACCAGCGG GGTGCTGTTT
GGCTACGATC CCCACGACCA ATTGCCGTAC CACTATAACC TCTTTGGCGC GACCGGAACC
GACAACCCGC ACTTGCTGAT GCTCGGCAAA TCGGGCAGCG GCAAAACCGT GAGTTTAGGC
ATGCTAGCGT TGCGCCATGC CGTCGCAGGC CACCAGATCG TGATGTTCGA TCCGGTCGGG
AATTGTGCGC GGCTGTGTGA GGCGGTCGGC GGTGGGGCGG CCTACTATCA CTTGGCCGAG
GACGTGGCGA TTAATGTGCT GGATCCAATG GAAACCAGCT TGCATCGCCA GAAAAGCCAT
GTGGAGCGCA AGTTGTCCAT GGTCTTAGGC CGCGCGATCA CCAGTGGCTC TGGTGTCCAG
TTGCGCCCGC GTGAGTTCAG TAACGCCGAA CGGGGAGCGC TCGATGCCGC CCTCGCATCG
ACGCGGATCT ATGGCCCCGA TGGGGTGTTT TTGGCGCAGA TGGATGACGA CACCGCGCCG
CTCTTGAGCG ATCTGGTGCT GGCCTTGCGC GAAACCAAGC GCCCCGTGGG GCAGGCATTG
GCTGAGGAGA TCACCGATAT TGCCTTGCAA TCGCAAGCCC ACTTGTTTGA TCGCCAAACG
ACCTTGAAGT GGGATTTTGG CAGCGATGTG GTGGCGTACA ACCTCAACAA CGCCGATAAA
GCCTTGCTCC CCTTGTACCT TGATCATGGC ATCGGGGCAC TCAACCACTA CATTCGCAGC
CCTGAACGAC GAGCGCGAGG CCAAAAGCTG GTCTGTGTGG TGGACGAGTT CGGCATTCTT
TCACAAATTG AAAGCCTGAA AAAAGAGGTC GCTAATGCGA CGAAAGAATG GCGGAATTAC
GGTGCAGCGC TTTGGTCGTG TGATCAGAAT TCGGCGACGT ACATGGGTGG TAGCGGCAAT
GCCCAAGACT TCAATAACCT GACGACCAAC AACACCGCCG TGAAGCTGTT TGGGCGGCAA
GAGGGAACCG ACGCGAACCT GCTTGGTGAG GCGTTTCCGG AGTTATCCCC CAGCGATATT
GCAGCCATCA GAACGGCAGG CCCAGGCGAG TTTGTGGGCA TTTTCGGCAC AAACGAAGTC
CACCATCTGC GCATGCAATT GACCGATCAA GAAGTGGCCC ACTTTATTCG GAAGGGTTAA
 
Protein sequence
MNQQPLCLTI DPFSVRQYRE ELPQLEQRFA NFWAGITYDA RLISCTRRFS FAPIRQRLRQ 
QTSPLDDLRD LMPMLVSAQE DGSRNAALTS LVQKRLATYE RATAALQDAP AVHAAFRAMA
TGASDATTLA VVADGCRRAL WPWRWLKNYR RAYEVMEREG NPLGIQHYFV AWPSEYTDAE
AVRSVLKGTF LLPDVQSAPL PPLFHGKYRE MATYLTPLDE GRPYLRVIHA FDVRGEWDLG
SMQELLGGEE ELAVALDVTT LPRAKAQRAT TDAFNVLEGA LTARNAVKDS RSERAYRDVS
YAMNQLDVQQ LHEVAYAILI QAPSVRDLNR ITQTLRDRMG ARLKLDVLSG TQGEYLKLFT
TTPSKQIAAP LIRRNALSEH VAAKTPWGIR KSDATSGVLF GYDPHDQLPY HYNLFGATGT
DNPHLLMLGK SGSGKTVSLG MLALRHAVAG HQIVMFDPVG NCARLCEAVG GGAAYYHLAE
DVAINVLDPM ETSLHRQKSH VERKLSMVLG RAITSGSGVQ LRPREFSNAE RGALDAALAS
TRIYGPDGVF LAQMDDDTAP LLSDLVLALR ETKRPVGQAL AEEITDIALQ SQAHLFDRQT
TLKWDFGSDV VAYNLNNADK ALLPLYLDHG IGALNHYIRS PERRARGQKL VCVVDEFGIL
SQIESLKKEV ANATKEWRNY GAALWSCDQN SATYMGGSGN AQDFNNLTTN NTAVKLFGRQ
EGTDANLLGE AFPELSPSDI AAIRTAGPGE FVGIFGTNEV HHLRMQLTDQ EVAHFIRKG