Gene SAG2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG2001 
Symbol 
ID1014812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1980913 
End bp1983183 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content34% 
IMG OID637317168 
Productconjugal transfer protein, interruption-C 
Protein accessionNP_688988 
Protein GI22538137 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.396509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGATT TTGAAGCGGA TCTAGCAGAT GATGTTAAAG AATTAGGGTT AGAAACACTC 
GATTTTACAG TTGATACGTT AACACATGAA ATGGAGATAC CGTATCAGTT TGATTGGTTA
ATTGGAGTTG ATCTTGGTAA AGGGCAATAC AATGCTAATA TAAAAGAATT TATTTATAAT
CAGTTTGAAT CTATCGCATC AAACTTCGCT TCTTTAGCTG GTTATGAGGT AGAAGTAGAT
GAAGATTGGT ATAAAGAACA TTCAGAAGAA GAATTACTTG TCTATAGTTT ATTGTCTACC
CTGAAAGCCA AAAGATTAAC GGATGTTGAT TTATTTTATT ATCAACGGAT GCAATTTTTA
AGATATGTTC CCCATACTAA AAGTGAGGTT ATAGCTAATC GGAATATGTT AAATGTGACA
GATACGTTGA TTAAATCACT AGAAGGTGGT TTTTTAAAAT TGGAGAGTGC TTATGGGAGT
TCATTTGTAT CCGTTCTTCC AGTAGGTCGG TTCTCTACCA TATTTAATGG CTTTCATCTT
GGTGAACTAG TTCAAAGAAT GAGTTTTCCG GTTGAATTAA GATTTAAAGC GGAATTTATA
GATAAGACAA AATTGGGTGG CACTATGGGT CGCTCAAATA CACGCTATGA TCAAATCATG
AAAGAAGCTT ATAACACAAA TACGGTTCAA CAAGATGATA TCTTGATGGG AGCATATTCG
CTTAAGGATC TTATGAAAAA AGTTGGAAAT AAGGAAGAAA TCATAGAGTA TGGTTGTTAT
TTGGTAGTTG CTGGTTCTAG TTTAAATCAA CTAAAACAAC GTAGATATGC CATTTTAAGT
TACTTTGATG ATATGAAAGT CAATGTTTAT GAAGCCAGTC ATGATACACC ATATTTATTT
CAAGCGTTAC TTTATGGTCA AGATTTACAA AAAACAACTC GTAAGTGGAA TCATTTAGTA
ACTGCTCGAG GTTTTTCGGA GTTGATGCTT TTTACAAATA CCCAATCAGG TAACAGGATT
GGTTGGTATA TTGGACGAGT AGATAATCGT TTAACTGCTT GGGATTCTAT TGACGAAGCT
ATTATGGGAT CTAAAAATCT TGTCTTATTT AATGCGACAG TCGCGAATAA AGAAGATGTT
GCTGGTAAGG TTACTAAAAA TCCTCACGTG ATTATAACTG GGGCGACAGG TCAAGGGAAA
TCATACTTAG CACAGATGAT CTTTTTACAT ACTGCTCAAC AAAATGTTCG TGTCCTTTAT
GTTGACCCTA AACGAGAACT ACGGCAACAC TATTTAAAAG TTGTTTCTGA TCCAGAGTAT
GCACGTAAGT TTCCTCTACG TAAGAAGCAA ATTGAGGAAA CGAATTTTGT CACTCTTGAT
AGTTCTGTCA AAGAAAATCA TGGTGTATTA GATCCTATTG TTATTCTTGA TAAGGAAGGT
GCTTCCTCAA CGGCTAAAAA TATGTTGCTT TATTTGTTAA AAAACGCTAC AGAGATTAAA
TTAGATCAAA CAACAGCTTT AACAGAAGCA ATTAGTCAAG TCATTGCAAA ACGTGAAGCT
GGTGAAGTCG TTGGTTTTAA TCAGGTTATT GAAGTTCTTA TTGATTCTGA AAGTGATGAA
GTTCAGTCAG TTGGTCGTTA TTTTAAAGCT ATTATTCAGA ATTCTATTTT AGAGTTGGCA
TTTTCAGATG GTGATGTTGC TGGTCTTTCT TACGAAGAAC GTGTGACAGT TTTAGAAGTT
GCGGATTTAT CATTGCCTAA GGACGGTTCA GATCATATAT CTGATCATGA AAGTAATTCG
ATTGCTTTAA TGTTTGCTTT AGGTGCGTTT TGTAAGCATT TTGGTGAGCG TAGTGATGAT
GAAACAGTTG AAATTTTTGA TGAAGCATGG GTACTTATGC AATCCTCAGA AGGTAAGGCT
GTTATCAAGT CTATGCGTCG TGTTGGTCGC TCTAAGTATA ATGTGTTGAT GCTAGTATCT
CAATCGGTTC ATGATGCAGA AAACGATGAT GATACGACCG GCTTTGGGAC AATTTTTTCA
TTTTATGAAA AATCGGAACG TGAAGATATT TTAAGTCATG TTGGTTTAGA AGTAACACCT
AAGAATCTAG AATGGATTGA TAATATGATT TCAGGTCAAT GTCTTTATTA TGATGTTTAC
GGTAACTTGA ATATGATTTC GATACATAAC ATTCATCCAG ATATTGATCC ATTGTTAAAA
CCTATGAAGA AAACAGTATC AAGTCACCTT GAAAATAAAT ATGCTTCGTA G
 
Protein sequence
MSDFEADLAD DVKELGLETL DFTVDTLTHE MEIPYQFDWL IGVDLGKGQY NANIKEFIYN 
QFESIASNFA SLAGYEVEVD EDWYKEHSEE ELLVYSLLST LKAKRLTDVD LFYYQRMQFL
RYVPHTKSEV IANRNMLNVT DTLIKSLEGG FLKLESAYGS SFVSVLPVGR FSTIFNGFHL
GELVQRMSFP VELRFKAEFI DKTKLGGTMG RSNTRYDQIM KEAYNTNTVQ QDDILMGAYS
LKDLMKKVGN KEEIIEYGCY LVVAGSSLNQ LKQRRYAILS YFDDMKVNVY EASHDTPYLF
QALLYGQDLQ KTTRKWNHLV TARGFSELML FTNTQSGNRI GWYIGRVDNR LTAWDSIDEA
IMGSKNLVLF NATVANKEDV AGKVTKNPHV IITGATGQGK SYLAQMIFLH TAQQNVRVLY
VDPKRELRQH YLKVVSDPEY ARKFPLRKKQ IEETNFVTLD SSVKENHGVL DPIVILDKEG
ASSTAKNMLL YLLKNATEIK LDQTTALTEA ISQVIAKREA GEVVGFNQVI EVLIDSESDE
VQSVGRYFKA IIQNSILELA FSDGDVAGLS YEERVTVLEV ADLSLPKDGS DHISDHESNS
IALMFALGAF CKHFGERSDD ETVEIFDEAW VLMQSSEGKA VIKSMRRVGR SKYNVLMLVS
QSVHDAENDD DTTGFGTIFS FYEKSEREDI LSHVGLEVTP KNLEWIDNMI SGQCLYYDVY
GNLNMISIHN IHPDIDPLLK PMKKTVSSHL ENKYAS