Gene Cag_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1107 
Symbol 
ID3748325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1493593 
End bp1495860 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content47% 
IMG OID637773638 
ProductComEC/Rec2-related protein 
Protein accessionYP_379412 
Protein GI78189074 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAGC CATCAAAGCC CCATAGTGCC GTAAAACCAA AGCGAGCAAT AGGTTTATCG 
TTGGCACCTT ATCCCGCTGT GCGGTTGCTT TTTTTTGTCA TTATAGGAAT TGTTGTAGGG
GTTGTTGCAC CTTTTTCGCT CACGGAGTGG CTATGGAGCG TAGCGTTATC ATTTGCGCTG
TTGTTGCTTA CATGGCTTTA TGAGCGAATT CGTTATCACC AAGCGGCTGT ACCGCATTTT
GGCATGGCGA TAATGTATTG TTTCGTTGTG GTTTCGGTGT TTGCAACGCT CAGTGCTTAC
CGATTGCACT ATGCACCACG TAACGGTTTA ACGCAATATG CTGGGCGCAC CGTGATTCTT
TATGGCTCCA TTGAAAGCCG TCCCGAACGT TCTAAAGGTG GTGCAAGTTG GGTTATGGAG
GTGCAAGAGC TTTTTGAACA TGGCAAAACC GTTACGCTTC GCGACCGCAC GAAAGTATTT
ATGCGCATGA GTGCCGATGC TCATCTTGCT GTTCAAAAAG GCGATATGGT TCGCGTAAAA
GGCAAGCTTG ATTTGTTGCC CGAAGCTGCT AATGCAGGGG AGTTTAATCC TCGCCATTAC
GGGGCTATGC AACAAATCTC TGTGCAGCTT TATGCTGCCG GTCCATGGCA GGTGCTCTAT
GAGGGTGAAA AGCGCTTACA TCCATTCGAG CAATATATGG TGCAACCAAC CTATCGCTAC
ATTATGCAGG CGCTTGCCGC ACTTCTGCCC GATGGCGAGG AGCGTAAGTT AGCCGCAGGC
GTGCTTACGG GTGAGCGCGA AACCATGTCG GAAGAGGTGT TTGAAGCTTT TAAGCGCACA
GGTACAGCAC ATATTTTAGC CGTCTCAGGC ATGAATGTTG GCTTGTTGGC ATTAATTATT
CAAGTATTTT TGCAACGGCT TAAAATTACG CCATTTGGGC GTTGGACGGC GTTTTTGCTC
TTTGTTTTTT TGTTGATACT CTATAGCAAT GTTACGGGGA ATTCAGCCTC AGTAACGCGT
GCGGCATTTA TGGCGTTAGT ATTGATTGCT GGAGAAACTG TTGGGCAAAA AACATATCCG
CTGAATTCCT TAGCCGTTGC TGACTTGATT ATTTTGCTTA TTAATCCGCT TGATCTCCTG
AACCCAGGTT TTTTAATGAC CAATGGCGCC GTTTTAGCGC TTTTTCTTGT TTATCCGCTT
CTCCATTTTC CACGCCCTAA AAATCGAACC CTTCTTTTAT CAATAGTGTG GTTTCTGCTT
GATAGCATCA TTATTACCCT TGCCGCAAGC ATTGGCGTTT CACCCGTAAT TGCCTACTAC
TTTGGCACCT TTTCGCTTAT TAGCTTTGTT GCAAACATTC CTGTTGTCTT TTTTTCTACC
TTGCTGATGT ATGCCTTAGT GCCAATGTTG GTTGTGTATG GGTTGTCGCA AGCTCTTGCC
TCCGTTTTTG CCGCAGGCGC TTTTTGGCTT GCCCGCATGA CATTGCAATC GGCATTGTGG
TTTAGCAATT TCTCTTTTGC TTCAATTCCC TTAAAACTTG ATGCGGTAGA GGTGTGGCTT
TATTACATAG TGCTTGCAGC GGTGCTCTTG CTGGCAACAC GTAAAGCGTG GAGCCGTGTT
GCCATAACGT TTTTGTTGGG AGTAAATCTT TTTGTGTGGT ACTCCTTGCT TTTTCGTCCA
AATCCAATAG CACCAACACT GCTTACTGTT AACTTAGGTC GGAATCTTGC AACAATTGTT
TCCAATGGCA GCGAAAGTGT GCTTATTGAT GTTGGCAAGA AACCCAAAGA TTATCAACGC
ATTAGCGCCC AATTTGAGCG TTTTGGCATT GTAGAGCCCA CTGCCGTTGT ACAATTTTAT
TCACCCGACT CGTTGATTTT GGCAACGCCA ACTCGCCACC ATTTTTTACG GAGCGATAGT
CTGCTGCGGC TTTCTTCCAT GGTTATTACG CGACCCGACG AAAAAATGGT AAAGCTTTGG
AGCCGCAACC AAAGCTATTT TCTTGCTTCA GGCACCAGCC GTTTAAAAGC GGGGGAGCCC
TATTGTGGCG ATGTGGCTTG TATTTGGATT TATCGCTTTG GCGAAAAGCA GCGCATTGAG
TTGGAACGGT GGCTTACCGC AACAAAGCCA AAAGAGGCTC TTTTGGTGCC AAGCTCATTT
TTATCGCGCG TGCAGCTTGT GGCATTGCAC CGTTTTGCAG CGGCTTATCC TCATGTTGAG
GTGCGTAGTA AAACTAAGCA GGTGGTGGTG AATGGGGGAG AGAGGTAA
 
Protein sequence
MTEPSKPHSA VKPKRAIGLS LAPYPAVRLL FFVIIGIVVG VVAPFSLTEW LWSVALSFAL 
LLLTWLYERI RYHQAAVPHF GMAIMYCFVV VSVFATLSAY RLHYAPRNGL TQYAGRTVIL
YGSIESRPER SKGGASWVME VQELFEHGKT VTLRDRTKVF MRMSADAHLA VQKGDMVRVK
GKLDLLPEAA NAGEFNPRHY GAMQQISVQL YAAGPWQVLY EGEKRLHPFE QYMVQPTYRY
IMQALAALLP DGEERKLAAG VLTGERETMS EEVFEAFKRT GTAHILAVSG MNVGLLALII
QVFLQRLKIT PFGRWTAFLL FVFLLILYSN VTGNSASVTR AAFMALVLIA GETVGQKTYP
LNSLAVADLI ILLINPLDLL NPGFLMTNGA VLALFLVYPL LHFPRPKNRT LLLSIVWFLL
DSIIITLAAS IGVSPVIAYY FGTFSLISFV ANIPVVFFST LLMYALVPML VVYGLSQALA
SVFAAGAFWL ARMTLQSALW FSNFSFASIP LKLDAVEVWL YYIVLAAVLL LATRKAWSRV
AITFLLGVNL FVWYSLLFRP NPIAPTLLTV NLGRNLATIV SNGSESVLID VGKKPKDYQR
ISAQFERFGI VEPTAVVQFY SPDSLILATP TRHHFLRSDS LLRLSSMVIT RPDEKMVKLW
SRNQSYFLAS GTSRLKAGEP YCGDVACIWI YRFGEKQRIE LERWLTATKP KEALLVPSSF
LSRVQLVALH RFAAAYPHVE VRSKTKQVVV NGGER