Gene Haur_3707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3707 
Symbol 
ID5735571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4660912 
End bp4662468 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content52% 
IMG OID641280859 
Productpolymorphic outer membrane protein 
Protein accessionYP_001546471 
Protein GI159900224 
COG category 
COG ID 
TIGRFAM ID[TIGR01376] Chlamydial polymorphic outer membrane protein repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000878669 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATTTT TGATTGGGCG CATGCTCTTG GTGCTTGGTT TGTTGGCTGG CGTATGGCCC 
AAGCCCAGCC AAGCCAGCGC GATCTTGATC GATTGCAATA CGACAGCAGT TAGCGCGGCG
ATTAGTGCTG GCGGCGCGAT CAACATCAAT TGCAACAGCC CCACCGTCTT GACGTTTGGC
AGCGAATTGG TGATGAACCA AGCAACTGAG ATTAATGGCA ATAACAATGC AATTTTCGAT
GGCCAGAATA CGCGGCGGTT GCTGCGCAGC GCCAATAATC TCAAAATCAC CTTGCGCAAC
CTCACGATTC GCAATGGCCG TACAACTGAC CAAGGCGCAG GCATCAAAGT TGGCTTTTGG
AATACGCTCA GCATAAGCAA TGTGCGTTTT GAAAACAACC AAGCAACCAA AGATACGGCT
GCTTGCGATG GTGGCGGGGC AATTTTTATC GGCGGTGGCA GCAGCGCCAT GATCGATCAC
AGCACCTTCA TCAACAATCG CGCCAATAAT GGCGGGGCAA TTAACAGTTT GCGCACCAAT
TTGATGGTCA GCAATAGCAG CTTCGAGCAA AATAGCGCCA TGCATACCGC AGCCATCAAT
CAACATGGCG ATTGTGGTGG CGGCGGGGCA ATCTACATCG ACGGCACGCG CATGCCCAAC
GATGGCGGCC CCGATGGCAT TGTGCTGCTC AATAATAGCT ATACCAGCAA CACCAGCAAC
AACCATGGTG GCGCAATCTT CATCGGTTTG TATAGCAACG AAACCGCCAC AATCGAGCGC
TCAAGTTTTA CCAACAACAG CGTAACTTAC GCAACATCAG CCGATTGGTC GGGTACAGGC
GGGGCCATTT GGTATGGCTT TGCTGCTGGT GGCGTGACTA ACGAACGGCT TTTTATCAAC
AACTCAACCC TCGAAGGCAA TAAAGCCATC GGCCAAGGCG GCGGTTTGTG GGTTGATGCA
CCTGCCACAA TTCGCAACAC CACCTTTTAT GCCAACGATG CCACCGATCC GCGCAGCTAT
CCCGATGATC AAGAATGGCG CAAAGGTAAC GGCGGGGCGT TGGCGGTCAA TAATAATGCA
GCCGTTGATA TTACCAATGC AACCTTTATG AACAACCATG CTGGCTTCAA CGGCGGAGCA
ATTGCAGGCC AAACCATCAC AATCCGCAAC ACGCTGTTCT ACAATAACAC CACCGATTGG
TCGATTAAGA TTATGCAGCA TTGCACCAAT GCGCTAATCG ATGGCGGCAA TAATCTGCAA
TACCCGCCTA AAAATCCTAA CCCTAATTAT TGGAACGAAA CCAACTGTAC CAGCAGCATG
CGCACCCCAG AACTAACACT GAGCAACATT GCCAATAACG GTGGCTCCAC CCGCACTGCC
GCTTTACCTG CTGGTAGTCC CGCGATCAAT CTGGGCAATC CCGCCAGTTG TAGTGCGCTC
GATCAGCGCG GCTATACGCG GGCTGGGGCT TGTGATGTTG GAGCATATGA ATACAACGGC
GCAGCATTTA GCCCAAGCCA TAGCATTTAT ATTCCGTTAG CCCGCAAACC CAATTAA
 
Protein sequence
MVFLIGRMLL VLGLLAGVWP KPSQASAILI DCNTTAVSAA ISAGGAININ CNSPTVLTFG 
SELVMNQATE INGNNNAIFD GQNTRRLLRS ANNLKITLRN LTIRNGRTTD QGAGIKVGFW
NTLSISNVRF ENNQATKDTA ACDGGGAIFI GGGSSAMIDH STFINNRANN GGAINSLRTN
LMVSNSSFEQ NSAMHTAAIN QHGDCGGGGA IYIDGTRMPN DGGPDGIVLL NNSYTSNTSN
NHGGAIFIGL YSNETATIER SSFTNNSVTY ATSADWSGTG GAIWYGFAAG GVTNERLFIN
NSTLEGNKAI GQGGGLWVDA PATIRNTTFY ANDATDPRSY PDDQEWRKGN GGALAVNNNA
AVDITNATFM NNHAGFNGGA IAGQTITIRN TLFYNNTTDW SIKIMQHCTN ALIDGGNNLQ
YPPKNPNPNY WNETNCTSSM RTPELTLSNI ANNGGSTRTA ALPAGSPAIN LGNPASCSAL
DQRGYTRAGA CDVGAYEYNG AAFSPSHSIY IPLARKPN