Gene SAG1235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1235 
Symbol 
ID1014042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1244650 
End bp1245927 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content44% 
IMG OID637316416 
ProductGBSi1, group II intron, maturase 
Protein accessionNP_688240 
Protein GI22537389 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00153428 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAT TGCTAGATAA GATATTATCT CGGAACAATA TGCTCGAAGC TTACAAGCAA 
GTGAAATCAA ACAAAGGTTC TGCTGGTATC AATGGGGTCA CTATCGAGCA GATGGATGAC
TATCTTCACC AAAATTGGCG AGAAACCAAG CAACTCATCA AAGAGAGGAG CTATAAACCT
CAACCGGTTC TCAGGGTTGA AATCCCAAAA CCAAACGGAG GAGTTCGTAA CCTAGGTATC
CCGACGGCTA TGGATAGAAT GATTCAGCAG GCCATCGTTC AAGTTTTGAG TCCACTCTGC
GAAAAACATT TTTCAGAGTA TAGCTATGGG TTCAGACCCA ATCGCTCCTG CGAAACAGCC
ATTGTTCAGC TACTTGAGTA TTTAAACGAT GGCTACGAGT GGATTGTGGA CATTGACTTG
GAAAAGTTCT TCGATACTGT TCCGCAAGAC AGATTGATGT CCCTGGTTCA TAATATCATT
CAAGATGGCG ATACGGAGTC ACTGATTCGT AAGTACCTCC ATTCGGGAGT TGTTATTAAC
GGACAGCGAC ATAAGACTTT AGTCGGGACA CCTCAAGGCG GGAATCTATC ACCCCTCCTA
TCTAATATTA TGCTTAATGA GTTAGACAAA GGGTTGGAAA AGCGAGGTCT TCGCTTTGTC
CGTTACGCCG ATGACTGTGT CATCACTGTC GGAAGCGAAG CAGCTGCTAA GCGGGTCATG
CATTCGGTCA GTAGCTATAT TGAGAAGCGA TTAGGGTTGA AAGTCAACAT GACTAAGACC
AAGATTGTCA GACCGAACAA ACTCAAATAC CTCGGATTTG GTTTCTGGAA ATCTCCAAAA
GGTTGGAAGT GTCGTCCTCA CCAAGACAGC GTTCAGAGCT TTAAGCGAAA ACTGAAGCAA
CTGACGATGA GGAAATGGAG CATTGACCTG ATAACTCGCA TTGAACGATT GAACTGGGTC
ATTCGAGGAT GGATAAACTA TTTCTCGCTT GGCAATATGA AGAGTATCAT GACACAAATA
GATGAGCGTC TGCGAACCCG TATTCGAGTG ATTATCTGGA AGCAATGGAA GAAGAAAGCA
AAGCGCCTAT GGGGACTCTT AAAACTAGGA GTTGCTAGAT GGATAGCCGA TAAAGTTTCT
GGATGGGGTG ACCACTATCA GTTGGTAGCT CAGAAGTCGG TACTCAAACG TGCTATATCA
AAACCAGCCC TCGCAAAGCG AGGACTGGTC AGTTGCTTAG ATTACTATCT TGAACGACAT
GCGTTAAAAG TTAGTTGA
 
Protein sequence
MSELLDKILS RNNMLEAYKQ VKSNKGSAGI NGVTIEQMDD YLHQNWRETK QLIKERSYKP 
QPVLRVEIPK PNGGVRNLGI PTAMDRMIQQ AIVQVLSPLC EKHFSEYSYG FRPNRSCETA
IVQLLEYLND GYEWIVDIDL EKFFDTVPQD RLMSLVHNII QDGDTESLIR KYLHSGVVIN
GQRHKTLVGT PQGGNLSPLL SNIMLNELDK GLEKRGLRFV RYADDCVITV GSEAAAKRVM
HSVSSYIEKR LGLKVNMTKT KIVRPNKLKY LGFGFWKSPK GWKCRPHQDS VQSFKRKLKQ
LTMRKWSIDL ITRIERLNWV IRGWINYFSL GNMKSIMTQI DERLRTRIRV IIWKQWKKKA
KRLWGLLKLG VARWIADKVS GWGDHYQLVA QKSVLKRAIS KPALAKRGLV SCLDYYLERH
ALKVS