Gene Cag_1414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1414 
Symbol 
ID3747173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1882972 
End bp1884003 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content47% 
IMG OID637773950 
Producthypothetical protein 
Protein accessionYP_379715 
Protein GI78189377 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR00661] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTC TTTTTGGTGT TCAAGGAACC GGTAACGGTC ATATTAGTCG CAGTCGCGAG 
CTTGTTCGTG CGCTTAAAGA AGCTGGGCAC GAGCTTGAAG TAATTATTAG CGGACGCAAA
GAAGAGGAGC TTAAAGAGAT TGAGATTTTT AAGCCCTATC GCGTTCTAAA AGGCATGACG
TTGGTAACGC AAAAAGGGCG CTTGAACTAT GTTGACACCA TGGTGCAACT CGATTTTGTG
CGCCTTGTTG CCGACATTGT AACGCTTGAT ACCGAAGGGG TTGATCTTAT TGTAACCGAC
TTTGAGCCAA TCACCTCACT AACGGCAAAG TTGAAAAATA TTCCCTCAGT AGGCTTTGGG
CATCAATACG CCTTCCGTTA CGATATTCCC GTAGCGCCAG GCTCTTTTTT TGAGAAATAT
GCTTTGTTGA ACTTTGCTCC AGCCCACTAC AACGCTGGTT TGCATTGGCA CCATTTCTCC
CAACCCATTT TTCCCCCAGT TATTCCCGAA ACCCTTTACG CAAAGCATCA TGTTGCCGTT
ATTAGTAACA AAGTGCTGGT TTACTTGCCT TTTGAAGAGG TGGAGGATAT CACCACCTTT
TTAACGCCCT TTACCGATTT TGAATTTTTT ATTTATGGTA AAGTGCAAGA GGGGAGCGAC
CATGAGCATT TGCACTACCG CACCTACTCG CGCGAAGGTT TTCTTGCGGA TTTAATGGAA
TGCACGGGCG TGGTATGTAA TGCGGGCTTT GAGCTACCGG GTGAAGCGTT GCACCTTGGC
AAAAAAATGC TGTTGCGTCC GCTTGACGGG CAAATTGAGC AGCAATCAAA CGCGCTGGGA
ATGGTGGAAC TTGGCTACGG CATGGCAATG GAGAGCCTTG ACCCCACAAT TTTAGCCGAT
TGGTTGCAGC AACCTTGTCG TGAACCGTTA CGCTACGCAC GCACCGTCAA CTACATTGCC
GAATGGATAA GTTACCGCCA TTGGGATGAG TTGGGGAAAT ACACGGCTAA GGCGTGGGTA
GATCACGCAT AA
 
Protein sequence
MKILFGVQGT GNGHISRSRE LVRALKEAGH ELEVIISGRK EEELKEIEIF KPYRVLKGMT 
LVTQKGRLNY VDTMVQLDFV RLVADIVTLD TEGVDLIVTD FEPITSLTAK LKNIPSVGFG
HQYAFRYDIP VAPGSFFEKY ALLNFAPAHY NAGLHWHHFS QPIFPPVIPE TLYAKHHVAV
ISNKVLVYLP FEEVEDITTF LTPFTDFEFF IYGKVQEGSD HEHLHYRTYS REGFLADLME
CTGVVCNAGF ELPGEALHLG KKMLLRPLDG QIEQQSNALG MVELGYGMAM ESLDPTILAD
WLQQPCREPL RYARTVNYIA EWISYRHWDE LGKYTAKAWV DHA