Gene Haur_3167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3167 
Symbol 
ID5735039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3999819 
End bp4000739 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content50% 
IMG OID641280310 
Productcytochrome c biogenesis protein transmembrane region 
Protein accessionYP_001545932 
Protein GI159899685 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0785] Cytochrome c biogenesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00198959 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACAAG CGCAAACACA GCGCCAACCC TCCCGACTGT TTACCATGGC TATCATTGGG 
CTAGCCGTGT TTGTGGTAGC CGCAATTTTG CTGATTGGGG TCGCTCAAGG TGGACAAAAT
GTGATCACTT TGGCGGTTCC AGCCTTTATG GCGGGAGTTT TATCGTTTTT ATCGCCCTGT
ACCTTGCCCG TGCTACCAGC CTATTTCGCT TGGACGTTTG GAATCAATAG TGCTGGTGAT
GAAAGCCTGG CGCAAAAACG CCAACGTACA ATTATCTCAT CGTTGGCCTT TTTTGCGGGC
TTGGCAACCA CGATGGTGGT GTTGGGTATG GCGATTACGG CCACTAGCCA AGCCATTCGC
GGCATTTCAT CCAACAAAGA TTTGTTCGCC CAAATTGGCG GGGTAATTAT CATTGTCATG
GGTGTGATGA GCATTTTTGG GATTGGCTTC ACCGGAGCGA ATATCAAACG TAGCTCCAGC
GCCACCGTTG GCAGTGCATA TCTGTATGGA TTAACCTTTG CGCTGGGCTG GACGGCTTGT
ATTGGGCCAA TTCTTGGGGC AATTTTGACC CTGTTGATTT CAACCGGAGC TTCGATTATT
GCTGGCGCAA CCTTGACCTT TATCTATGTG CTTGGCTTGG CTTTGCCCTT GATGATCGTA
GCGACCTTCT TCAACAAACT TGGCCAAGGC TCCAAGGGCT GGAAGCTTCT GCGTGGGCGG
GCTTTGGAGT TCAACATTGG CAAGCGCGTC GTGATTTTGC ACTCCACCAG CGTGATCAGC
GGTATTTTGT TGATTGCCGT GGGGATTTTG CTGGTAACTG GACAAATGAG CACATTGAAT
GATATTGCCC TCGATAACCC CATTAGCAAA TGGGCCAACA ATGTGCAATA CGATATTCAA
ACCTTTTTTA CTGGCGAGTA G
 
Protein sequence
MQQAQTQRQP SRLFTMAIIG LAVFVVAAIL LIGVAQGGQN VITLAVPAFM AGVLSFLSPC 
TLPVLPAYFA WTFGINSAGD ESLAQKRQRT IISSLAFFAG LATTMVVLGM AITATSQAIR
GISSNKDLFA QIGGVIIIVM GVMSIFGIGF TGANIKRSSS ATVGSAYLYG LTFALGWTAC
IGPILGAILT LLISTGASII AGATLTFIYV LGLALPLMIV ATFFNKLGQG SKGWKLLRGR
ALEFNIGKRV VILHSTSVIS GILLIAVGIL LVTGQMSTLN DIALDNPISK WANNVQYDIQ
TFFTGE