Gene Cag_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1047 
Symbol 
ID3747775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1416029 
End bp1417507 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content44% 
IMG OID637773576 
Producthypothetical protein 
Protein accessionYP_379352 
Protein GI78189014 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000670981 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTGCC CTTACTACAA CCGACCGCCA ACATCATCGT TACCATTCCC CACCGCCGTA 
GAGACAACGC ATGCGTTGTC TCTACAATCC ATAAAAAAAT TTCCTACCTT AACACTATAC
AACCTCCACG CCTTTTTCCA CGTTAATTTA TTAACGATTA TGGCTACCAA CTCTTCCTCC
CCGCTTGTGC GTCGTGAATT GTTCATTTTC GATGCTTCTG TTTCTAACCT TTCAACGCTT
TCTTCTGCAT TATCTGCTAA CAGCAGCTAT TTTGTGCTTG ATTCCACGCG CGATGGTTTG
GTGCAAATTG CTGATTTGCT TGCGGGGCAA ACGGATATTG ATTCCCTCCA TATCTTTAGC
CACGGCAGTG CTGGTTCGTT GCAGCTTGGC AACTCCTCGC TTTCGCTTGT AAATCTGAAT
AACTACGAAT TACCGTTGTC GGTGATTGGT TCATCGTTAT CATCAAGTGG TGATATTTTG
CTGTACGGTT GCAATGTTGG TGCCGGTGAT GAAGGACTTG CGTTTGTTGA TAAACTTGCG
AAAATGACGG GCGCTGATGT TGCGGCTTCG GATGACTTGA CGGGTGCAAC GGCGCTTGGT
GGGGATTGTG AGTTGGAGGT GGAGAGTGGG GTTATTGATG AGGCTTCATT TTATTATGCC
CCTGAATATG CTGGACTTCT TGGAGCTGTG GGACCAGAGT TTCATGTTGA CACCTCTGAT
ATTCAAGTTT GGTCATATGA ACCATCCGTG GCTGCTTTAG CTAATGGTGG TTTTGTGGTA
ACATGGATTT CGGAAACCTT GGAGACACTG TCCTCAGACA CCCATACCGA TATACATGGA
CAGCTATACA ACAGTGAGGG TGCAATGGTT GGATCGGAAT TTCAGGTTAA CACTTACACT
CAATACGGTC AATACACTCC TTCTATAACC GCTTTAGCTG ATGGTGGTTT TGTGGTAACA
TGGATTTCGG AAACCTTGGA GACACTGTCC TCAGACACCC ATACCGATAT ACATGGACAG
CTATACAACA GTGAGGGTGC AATGGTTGGA TCGGAATTTC AGGTTAACAC TTACACTCAA
TACGGTCAAT ACACTCCTTC TATAACCGCT TTAGCTGATG GTGGTTTCGT AATTATATGG
AGATGCGTAA ATAATGACGA CTATAACTGT AACTATATAC ATGGCCAGCG CTATAATGCT
GATGGGATAA TGGTTGGTTC AGAGTTTCAG GTAAACACCT ATACTCAAAT TGGGGCATAT
GAACCATCCG TGGCTGCTTT AGCTGATGGC GGTTTCGTGG TAACATGGGA ATCAGGAATA
GTAACAACAT GGAAGTCAGG ATATCAGGAT ACTTCCAATT CAGATATTTA CGGTCAAATA
TTCAATGTTG ATGGAGCAAT GGTTGGCTCG GAATTTCGGA TCAACACCTA TACAAAAGGT
TTTCAAGGCT GTCCTTCTGT GACCTCCCTT ACTAATTAA
 
Protein sequence
MNCPYYNRPP TSSLPFPTAV ETTHALSLQS IKKFPTLTLY NLHAFFHVNL LTIMATNSSS 
PLVRRELFIF DASVSNLSTL SSALSANSSY FVLDSTRDGL VQIADLLAGQ TDIDSLHIFS
HGSAGSLQLG NSSLSLVNLN NYELPLSVIG SSLSSSGDIL LYGCNVGAGD EGLAFVDKLA
KMTGADVAAS DDLTGATALG GDCELEVESG VIDEASFYYA PEYAGLLGAV GPEFHVDTSD
IQVWSYEPSV AALANGGFVV TWISETLETL SSDTHTDIHG QLYNSEGAMV GSEFQVNTYT
QYGQYTPSIT ALADGGFVVT WISETLETLS SDTHTDIHGQ LYNSEGAMVG SEFQVNTYTQ
YGQYTPSITA LADGGFVIIW RCVNNDDYNC NYIHGQRYNA DGIMVGSEFQ VNTYTQIGAY
EPSVAALADG GFVVTWESGI VTTWKSGYQD TSNSDIYGQI FNVDGAMVGS EFRINTYTKG
FQGCPSVTSL TN