Gene Cag_1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1408 
Symbol 
ID3747167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1874297 
End bp1876192 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content49% 
IMG OID637773944 
Productmembrane-fusion protein-like 
Protein accessionYP_379709 
Protein GI78189371 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.382309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAAC ATATGAATGC AAATCTTGAT GCAAGTTTAA GTGCCTTACG TGCTCTTCGC 
GATTTTAAGG GGAGGGAGGG TGAATTTTGG CTCTCTATGG CAACCCACAT TTCTCGCCTT
TTTCAAGCGG AACGTGTGGT GCTGCTGCGT CGTGCAGAGG TAGGATGGAA CCCGTTAAGC
TTTTGGCCGC TTGCTTCGGC TCGTTCAGCG ATTCCCCCAT CAGCACAGCT TGCTGAGCTT
GCAACTACCG CCAAGCAAAC GGCTGTTGCC TATGCTCCGC TGCCTGAGCA ACTTGGCACA
ACATTGCAAC ATGCGGTTTC AACGGCAGCC GAACAGTCGT CAGAAAAAAA AGCTGACAAC
ATATCAGGCA ACACGTTGCT TGCATTTCAT CTCCATACCG AAAGTGCTGA TGGTGAAATA
ATGGCGCTCT TATGGCGAGC GCACGATAGT GCTGCATTGC GCAATGGCGA TCTGCTTAAA
CGAGAGCTGT TAGTTGACCT TCCATTGCAA TATCGTCAAA GCAACACAAC TCGTCCACTT
ACCTCCGCTA CGCCTGAATC ACTTGATGTT GTGCTCAGTA TGAATGAGCA CACAACTTTT
ACGGGTGCGG CAATGTCGCT CTGTAATGAG CTTGCATTTC GTTTAAGCTG TTCGCGTGTA
AGCATTGGGT GGAAAGATGG TGAGTATATT CGTTTGCAAG CGGTAAGCCA CACCGAAAAG
TTCGACCGTA AAATGAGCGT GGCGCGAGCG TTGGAAGTGG TAATGGAGGA GTGTTTTGAT
CAAGATGAAG AGCTGCTTGT GCCCGAAGTA GCAGGCACCT CCACCACCAT TATTCGAGAG
CATCGGGCAT TTGTGGCAAA GCAAGGGGTG GGTGCCATCC TTTCGCTGCC GTTGCGGCTT
GGCAATGAGG TGGTTGCGGT GTTAAGCTGC GAGCGCGACA AACCCTTTAG TGCCGACGAC
ATTCGTAGCC TTCGCATTAT TTGCGACCAA GTAACGCGCC GACTTGGCGA CCTCAAGCAT
TTTGACCGTT GGTTTGGCGC TGTAGCGCTT GATAAGGTAC GCAATTGGGC ATCATCCCTT
ATTGGTACCG ATAAAACCGT GCATAAAATT TATGCCGTAG TGGGCAGTAT CCTCCTTCTC
TTCTTGCTTT TTGGCAAAAT GGAGTACAAG GTAGAAGCTC CCTTTATTCT TCGTACTCAC
GATCTTGCCC TTCTTTCAGC TCCTTTTGAT GGCTACATCG AACGTGTAAG CCGCAAGCCG
GGCGATTTGG TAACAACGGG CGATCCACTT ATCCTCCTCG ATACTCGCCA ACTTTTGCTT
GAAGAATCGC GTTCTGCTGC CGATGTGTTG CGTTATCAAC AGGAGGAGAA AAAAGCTATG
GCTCAAAATG CCTTAGCCGA AATGAAGGTT GCCGAAGCCT TGCGCCGCCA AGCCGACAGC
CGTTACCAAA TGATTCGCTA CAACTTACAG CACGCTGATA TTCGAGCACC CTTTAGCGGT
ATAGTGGTTG AGGGCGATCT TGAAAAGTTG CTTGGAGCGC CCGTGCGTAA GGGTGATGTG
CTCTTAAAAG TAGCCAAGCT TGAAAAGCTC TACATTGAAA TTAAAGTGGC AGAGCGCGAC
ATTCAAGAAT TTAAGGTAGG GCAAGAGGGC GAAGTGGCGT TTATAAGTCA GCCAAGCAAA
AAATATACCG TTGTTGTTGA TCGCATTGAA CCGATGGCGG TAACCGAGCA AAAAGGCAAC
GTCTTTTTAG TATTGGGGCA TATTACCGAA GCGCGTGATG CGTGGTGGCG CCCCGGCATG
AGCGGTTTAG CAAAAGTAAG TGTTGGAGAG CGCCACATTT TGTGGATTTG GCTCCACCGC
ACGCTTGATT TCTTCTCCAT GAAACTTTGG TGGTAA
 
Protein sequence
MDEHMNANLD ASLSALRALR DFKGREGEFW LSMATHISRL FQAERVVLLR RAEVGWNPLS 
FWPLASARSA IPPSAQLAEL ATTAKQTAVA YAPLPEQLGT TLQHAVSTAA EQSSEKKADN
ISGNTLLAFH LHTESADGEI MALLWRAHDS AALRNGDLLK RELLVDLPLQ YRQSNTTRPL
TSATPESLDV VLSMNEHTTF TGAAMSLCNE LAFRLSCSRV SIGWKDGEYI RLQAVSHTEK
FDRKMSVARA LEVVMEECFD QDEELLVPEV AGTSTTIIRE HRAFVAKQGV GAILSLPLRL
GNEVVAVLSC ERDKPFSADD IRSLRIICDQ VTRRLGDLKH FDRWFGAVAL DKVRNWASSL
IGTDKTVHKI YAVVGSILLL FLLFGKMEYK VEAPFILRTH DLALLSAPFD GYIERVSRKP
GDLVTTGDPL ILLDTRQLLL EESRSAADVL RYQQEEKKAM AQNALAEMKV AEALRRQADS
RYQMIRYNLQ HADIRAPFSG IVVEGDLEKL LGAPVRKGDV LLKVAKLEKL YIEIKVAERD
IQEFKVGQEG EVAFISQPSK KYTVVVDRIE PMAVTEQKGN VFLVLGHITE ARDAWWRPGM
SGLAKVSVGE RHILWIWLHR TLDFFSMKLW W