Gene Cag_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1404 
Symbol 
ID3747163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1869567 
End bp1870568 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content49% 
IMG OID637773940 
Producthypothetical protein 
Protein accessionYP_379705 
Protein GI78189367 
COG category[S] Function unknown 
COG ID[COG1774] Uncharacterized homolog of PSP1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAACG TTGTTTGCAG TTGTTTGCTT GGCGATAATG CCCATATTCG CTTACGCCTA 
TCACAATTGT TGCATAGCGA GGTGGAAGCG CTTTATAGCA ATGAGCCGCC GCTTGGCGAG
GGGCAGCCTG AAATTTGTGA AGTTGAGTTG CAGGGGTGCC GTCGCGACTT TTTTGTGAAT
GCTTCTGGGG TGCCTTGTGT GGTGGGGCAG CAAGTTGTGG TGGAGTCGGA TGGTGGGTAT
GATTACGGTC TGGTTTATTC CACAGGCGCT ATTGCCCGTA AAAAGCTTCA GTTAAAGGGG
CTTGATAAGC AGGGTATTGA GTGGAGTTCG GTGGTGCGCG TTGCCGATGA GCACGATGCG
CGTGCTATTG AGGAGTTGCA GCGTCGGCAG GCTGAAATTC GTGAAGTGTG CCTTGCTAAA
ATTAAAAGGC ACGAGCTTGA TTTAAAGTTG GTGGATGTTG AGTTGCGCAT GGATCAGCAA
AAGCTGTCGG TTTATTATAC GTCGTCGCAC CGTGTGGATT TTAGAATGTT GGTGCGCGAT
TTAGCGGGCG AGTTTAAGGC GCGTATTCAA ATGGTGCAAA TTACCACGCG CGAAGAGGCG
CGTCGGGCGA ATGCGTTTGG TCCGTGTGGC AATTTGCTCT GTTGCTCCAG TTGGATTCAA
AAAATTCAAG CCAATCCCTT TGCCGATAAA ACGCACTATT CCGAAAACCC CTCTAATAAC
GATTCCCACA CCTTTAACAT GACGGGACTT TGCAATCGCC CAAAGTGTTG CATAGGTTTT
ACTACTCGTC AAGATAAAAA TGGTGGGCGC ATTGGTGGTT CGTGCTGTTC GCAGCAGCAG
CCATTGCCAA CGGTTGGCAC GCTTCTTTCA ACCCCCGATG GGCAGGCGCA AATTGCTTTT
GTTGATGCTC AAAAAAAGCT TGTGGTTATT CGTTACCAGC ATAACAACCA AACGCGCCGC
TTTCCTCTCG ATAAGTTTAA CGCTCTTTTT ACCCGTCAAT AA
 
Protein sequence
MSNVVCSCLL GDNAHIRLRL SQLLHSEVEA LYSNEPPLGE GQPEICEVEL QGCRRDFFVN 
ASGVPCVVGQ QVVVESDGGY DYGLVYSTGA IARKKLQLKG LDKQGIEWSS VVRVADEHDA
RAIEELQRRQ AEIREVCLAK IKRHELDLKL VDVELRMDQQ KLSVYYTSSH RVDFRMLVRD
LAGEFKARIQ MVQITTREEA RRANAFGPCG NLLCCSSWIQ KIQANPFADK THYSENPSNN
DSHTFNMTGL CNRPKCCIGF TTRQDKNGGR IGGSCCSQQQ PLPTVGTLLS TPDGQAQIAF
VDAQKKLVVI RYQHNNQTRR FPLDKFNALF TRQ