Gene Cag_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1094 
Symbol 
ID3747961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1480443 
End bp1481729 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content48% 
IMG OID637773625 
Producthypothetical protein 
Protein accessionYP_379399 
Protein GI78189061 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.437038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGAAA CGCTCTCTTT TTTCCTGCTC CTTGCTCTTT TTGGGCTTCT CATATTTCTT 
GCTATTCGTT TGCTTAGCGC CGCTCCTAAG CAGGCTGAGT TGGCGCGTTT GCAGGCTCTT
GAAGCCGATG CTCTTCGCAA TATGGAGCGC TTAACTGCCC TTGCTGAGGA GCATGAAGCG
TTAAAAATTC GTTATGCACG TCTTGAGGTG GCTTACCAAA ATGAGCAACA ATCAGTGGCT
GAAAAAGAGG CGTTGATGCT TGCTTCTGAA GAGCGGTTAA AAAAGGAGTT TGAGTTGCTT
TCGCTGCGCA TTTTGGAAGA GCGTGGAAAA GCGCTTGGGG CTGAGCAGCG TGAACGGCTT
GATACTCTTT TGCTGCCGTT ACGCCAGCAG CTTGAAGCCT ATCGCCAACG CATTGAAGAG
GTGCATCATG CGGACACGTT GCTTTCGGGG CAGCTTATTG AAGAGGTGCG GCAGTTGCAG
GCATTAAGTA GTCGCGTGAG TAACGATGCC CAACAGCTTG CCCATGCCAT TAAAGGTGAT
TCAAAAGTGC AGGGTAATTG GGGGGAAATA ATTATTGAGC GAATGTTTGA AGCTTCGGGG
CTTGAAAAAG GGCGCGAATA TCTTGCGCAA GAGAGTTTTC GTGATAGCGA TGGTGCCCTT
AAGCGTCCCG ATTTTATGGT GCTTTTGCCC GATAACAAAG CTATTATTGT GGATTCAAAA
GTCTCTCTTA CTGCTTTTGA GCGTTATAGT GCGCTAAGCG ATCCCGATGA GCAGCAAATT
GCGTTGCGTG AGCATCTGCA ATCGGTGCGG CGCCACATTA CCGAGTTGCA AGCAAAAAAC
TATCATGAAT TGGGAGGCAA CCGCACGCTT GATTTTGTGT TGCTCTGCAT TCCCATAGAA
GCAGCATGGC AAGCGGCGAT GCAAGCCGAT CCCGCATTAC TTTATACGCT TGCAGGGCGT
AACGTGGTGG TTTGTAGCCC TACAACGCTG ATGATGACCC TCAAGCTTAT TGCTCAATTG
TGGCGACGTG AGCACGAAAA CCGCAATGCT GAACTTATTG CCGAAAAGGC AGGGCGCATT
TACGATCAAG TGGCTTTGCT TGCTCATAGT ATGTTGGAGG CACAAAAAAA GTTAAGCAAT
GTTAATGATT CGTTTGAACA GGTGTTAAAG CAGCTTAAAA CAGGGCGTGG AAATTTAATT
GGGCGCGTGG AGGAGATTCG TAAGCTTGGG GCTAAAGTGA ATCGCCAAAT GCCGCTTGAT
GTTACAGCAG AGGCGTTAGA GGAGTAA
 
Protein sequence
MLETLSFFLL LALFGLLIFL AIRLLSAAPK QAELARLQAL EADALRNMER LTALAEEHEA 
LKIRYARLEV AYQNEQQSVA EKEALMLASE ERLKKEFELL SLRILEERGK ALGAEQRERL
DTLLLPLRQQ LEAYRQRIEE VHHADTLLSG QLIEEVRQLQ ALSSRVSNDA QQLAHAIKGD
SKVQGNWGEI IIERMFEASG LEKGREYLAQ ESFRDSDGAL KRPDFMVLLP DNKAIIVDSK
VSLTAFERYS ALSDPDEQQI ALREHLQSVR RHITELQAKN YHELGGNRTL DFVLLCIPIE
AAWQAAMQAD PALLYTLAGR NVVVCSPTTL MMTLKLIAQL WRREHENRNA ELIAEKAGRI
YDQVALLAHS MLEAQKKLSN VNDSFEQVLK QLKTGRGNLI GRVEEIRKLG AKVNRQMPLD
VTAEALEE