Gene Francci3_1269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1269 
Symbol 
ID3906115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1515171 
End bp1516535 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content69% 
IMG OID637878603 
ProductCBS 
Protein accessionYP_480376 
Protein GI86739976 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.183003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCCG GTGATCTTTT TCTCGTCTTC ATCGCCGTGA TGGGCTCGCT TGCCGCGGCG 
GGTCTCGGCG GCATCGACGC GGCCCTGACC CGGGTCTCCC GGGTGACGGT CGAGGGTTTC
TCCCGCCAGG GCCGGGCCGG CGCCCGGAAC CTGGCGACCG TCGTGGCCGA TCCGGGCCGC
TACCTCGCGC TGCTGCTGCT CCTGCGCATC GTCGCGGAGA TGCTCGCCGC GGCCTGTATC
ACGGTGCTGT TCGTGCACGC CTACGGGGCC GGGTTCGCCG CGATCGGCCT CGGGACGCTG
GCCTCGACCC TGGTCGCGTA CATCCTCGTC GGGGTGATGT TCCGCACCCT GGGCCGCCAG
CACGCCCCTG CGGTGGCGTT GGCCAGCGCC GGTCTCACGG TCCGGCTGGC GCGGATCTTC
GGGCCGTTGC CCCGGCTGCT GATCGCGTTC GGCAACGCGG TCACCCCCGG TCCTGGTTTC
CGGGATGGTC CCTTCGCATC CGAGGCCGAG CTGCGCGATC TCGTCGACCT CGCCGAGGAG
AACGAGGTCA TCGAGCGTGA GGAACGCGAC ATGATCGCGT CGGTGTTCGA GCTCGGGGAC
ACCCTGGTGC GCGAGGTGAT GGTGCCGCGT CCGGACATGG TCTTCATCGA GTCGACGAAG
ACCGTCCGGC AGGCTCTCGC GCTGGCCCTG CGCAGCGGCT TCTCGCGGAT CCCCGTCATC
GGGGAGAGCG TCGACGACGT GGTGGGGATC GCCTTCCTCA AGGACATGGT CCGCCGGGAA
CGGGAGGGGG GCGAGGATGG TGCGATCGCC GAGATCATGC GTTCACCCGC ACTCGTGCCC
GAGAGCAAGC CCGCGGACGA CCTGCTCCGT GAGATGCAGG CGTCGCGTAC CCACATGGCG
ATCGTCATCG ACGAGTACGG TGGAACCGCC GGACTCGTCA CCATCGAGGA CATCCTTGAG
GAGATCGTCG GTGAGATCAC CGACGAGTAC GACAACGAGG TTCCGCCGGT GGAGTGGATC
GACGCCAACA CCGCGCGGGT GACCGCCCGG CTCGACGTCG ACGATCTGGC GAAGCTGTTT
GACTTCGACG TGGACGACCT GCCCGGTGCG GATGACAGCC TCACCGTCGG TGGCCTGCTG
GCCACGGCCC TGGGACGGGT GCCCATCCCG GGTGCGACGG TCACGGTCGG TGGGTTGCGG
CTGTCGGCGG AACGGGCCGC CGGCCGGCGT AACCAGATTG GCACCGTGGT CGTCCAGCGG
TTGCCGCATC CGTCGAACGG CGACGACGGC TCGCCGCCTC CCCGATCCGA CGACGGCGGC
GAACCCGTGC GCAGCTCGTC GTCGGACAGA AAGGTGAATT CGTGA
 
Protein sequence
MSSGDLFLVF IAVMGSLAAA GLGGIDAALT RVSRVTVEGF SRQGRAGARN LATVVADPGR 
YLALLLLLRI VAEMLAAACI TVLFVHAYGA GFAAIGLGTL ASTLVAYILV GVMFRTLGRQ
HAPAVALASA GLTVRLARIF GPLPRLLIAF GNAVTPGPGF RDGPFASEAE LRDLVDLAEE
NEVIEREERD MIASVFELGD TLVREVMVPR PDMVFIESTK TVRQALALAL RSGFSRIPVI
GESVDDVVGI AFLKDMVRRE REGGEDGAIA EIMRSPALVP ESKPADDLLR EMQASRTHMA
IVIDEYGGTA GLVTIEDILE EIVGEITDEY DNEVPPVEWI DANTARVTAR LDVDDLAKLF
DFDVDDLPGA DDSLTVGGLL ATALGRVPIP GATVTVGGLR LSAERAAGRR NQIGTVVVQR
LPHPSNGDDG SPPPRSDDGG EPVRSSSSDR KVNS