Gene Clim_1610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1610 
Symbol 
ID6354832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1741568 
End bp1742794 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content56% 
IMG OID642669211 
Productpentapeptide repeat protein 
Protein accessionYP_001943633 
Protein GI189347104 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000228674 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAATA ATATCCGTTC GATTCTCCTT TTCTGCGCTC TTGTGCCGGC ATTGCCTGTC 
TCTGCCTCGG CCTTCAATAC CGCAGATTTC AATGCGCTGA AAACCGGCGT GAAACCATGG
AACAGTTACA GGGCCGGGCT CGGCGGACGT GTCGCGGATC TTTCCGGGGC GCAGCTTAAG
GGCATGAACC TGAGAGGGGC GGACTTGAGC TACGCCGATC TTTCCGGTGC CGATCTCGCC
AGTTCCGATC TCAGTAAAGC CAGGCTCGAT CATGCCCGAC TCGACTCGGC CGTACTCCGT
TCGGCTTTGC TGGTCAGGGC TTCGCTCGAT AAAGCGCGGC TCCATAATGC CGATCTTGAA
GACGCAGTGC TTGAAGCCGC TTCGTTCAAA GGAGCCTTTA TGCAGACCGC GGTACTCAAG
AAGGCAGACT GCACCGGCGC CGATTTCAGC GGTGCCGATC TCCGGGAAAC GAATTTTCGT
GAAGCGAGGC TTGCCGGTGC ACTGCTAACC GGTGCTGATC TGCGAGCGAC CTACCTCTGG
CGGGCCGACA TGAGCAGGTC GGTATTGAGT GGTTCCAGGG TTTCGCCGTC CACGGTACTG
GCTTCAGGCA GCTATGCTTC GCAGGAGTGG GCTTCGGAGC ATCGCGCGGA GTTCCTCAAC
GATGCACCTG AGCCCGTTCA GGTTGTCGCA GGAATGGCTC GTACCGGTCA GCCTCTTCCA
ACGGTAACGG GAAACGTCGG AAACTCTGAG TCGGCAAAAA GCGGCTCGAA GGGAAACCTG
TGGCGCAACT CCGGTTCCGG CGCAGCTGTC GTTTATGATC AGGTGCTCTA TAAAAAGCTG
AAATCCGGGG TATTCGCATG GAACGATATG CGCAAGCGCA ACCGGGCCAT GGAAGTCAAT
CTCCGTCAAG CGAAATTCGA TCAGAAAAAT CTCAGTTATG CCGATCTTGC CCATGCCAGG
CTGCAGGGAG CAAGTTTCAG GAAGGCCGAT CTTTTCGATG CCGACCTTCG GAACGCCGAT
CTTTCGGGAT GCGATATGCG CGAAGCGAAT CTTGAAAAGG CCGATCTGGG AGGAGCCGAT
CTTTCCGGTG TGAATCTCTG GCGGGCGAAT CTCGGCCGCG CGCGTCTTAA CGGCGTTAAG
GTTTCCGCCT CTACTGTTCT CGATACCGGC AAAAAGGCTG ATCAGAAGTG GGCTGAACGG
CATGATGCCG TATTTATTCA TGAGTAA
 
Protein sequence
MQNNIRSILL FCALVPALPV SASAFNTADF NALKTGVKPW NSYRAGLGGR VADLSGAQLK 
GMNLRGADLS YADLSGADLA SSDLSKARLD HARLDSAVLR SALLVRASLD KARLHNADLE
DAVLEAASFK GAFMQTAVLK KADCTGADFS GADLRETNFR EARLAGALLT GADLRATYLW
RADMSRSVLS GSRVSPSTVL ASGSYASQEW ASEHRAEFLN DAPEPVQVVA GMARTGQPLP
TVTGNVGNSE SAKSGSKGNL WRNSGSGAAV VYDQVLYKKL KSGVFAWNDM RKRNRAMEVN
LRQAKFDQKN LSYADLAHAR LQGASFRKAD LFDADLRNAD LSGCDMREAN LEKADLGGAD
LSGVNLWRAN LGRARLNGVK VSASTVLDTG KKADQKWAER HDAVFIHE