Gene Francci3_3855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3855 
Symbol 
ID3905603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4619021 
End bp4620100 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content74% 
IMG OID637881181 
Productsuccinyl-diaminopimelate desuccinylase 
Protein accessionYP_482934 
Protein GI86742534 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01900] succinyl-diaminopimelate desuccinylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.149878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCG ACCTCACGGC GCCCGTCGGC GAGCTGACCC GGGCGCTGGT TGACGTGCCC 
TCGGTGAGTG GGGACGAGGC CGCCCTCGCG GGCGCGGTGG AGAAGGCGCT CACCGCGGTG
GACGGGCTGC GGGTTGACCG GGACGGTGAC GCCGTGGTGG CCAGGACCGA GCTGGGGCTG
CCCGGCCGGA TCCTGCTTGC CGGCCATCTG GACACCGTTC CGCTGGCGGG CAACCTGCCC
TCGCGTGTGG TGGGCGGGCG GCTCTACGGC TGCGGGACGT CGGACATGAA GGCAGGGGTC
GCGGTCGCGC TGCGGCTCGC GGCCACGCTC CCGGTCGCCA CGCCCGGCGC GATGAGCCAC
GACGTCACCT GGGTCTGCTA TGACCACGAG GAGGTCGAGG CGGCCCGCAA CGGGCTGCGT
CGGCTCGCCG CCCGGCACCG GGACTGGCTG GATGCGGATC TGGCCATCCT GATGGAACCG
ACCTCCGGCG AGATCGAGGC GGGTTGCCAG GGCACCCTGC GGGTGGTCGT GACGCTTCCC
GGCACCCGGG CGCACTCGGC CCGGTCGTGG CTCGGGGACA ACGCCATCCA CAAGGCCGGC
GATCTGCTCC GCCGCCTCGC CGGCTACCGG GCGCGGACGG TGACGCTGGA CGGCTGCACT
TACCGTGAGG GGCTCTGCGC GGTGCGGATC GACGGTGGGG TGGCGGGCAA CGTGATCCCC
GACCGGTGCC AGGTCACGGT GAACTTCCGG TTCGCTCCGG ACCGGGGCCC CGACGAGGCG
GTGGCGCATG TTCGCGAGGT GCTGGGTGGC TACGACGTCG AGGTCACCGA TCTCGTCGGC
GGGGCGCTGC CCGGGCTCGC CGCCCCGCAC GCGGCGGCGT TCGTCGCCGC CACCGGTCGT
GTGCCGGTGG CGAAGTACGG CTGGACGGAC GTGGCGCGCT TCGCCGAGCT CGGGATCCCG
GCGCTCAACT ACGGGCCGGG CGATCCCAAC CTGGCCCATG CCCGGGACGA GTACGTCGAG
CTCGCCGCGA TCGACGAGGC CGAGCGGCTG CTACGGGCGT ACCTGTCCGG TGTCTCCTGA
 
Protein sequence
MSLDLTAPVG ELTRALVDVP SVSGDEAALA GAVEKALTAV DGLRVDRDGD AVVARTELGL 
PGRILLAGHL DTVPLAGNLP SRVVGGRLYG CGTSDMKAGV AVALRLAATL PVATPGAMSH
DVTWVCYDHE EVEAARNGLR RLAARHRDWL DADLAILMEP TSGEIEAGCQ GTLRVVVTLP
GTRAHSARSW LGDNAIHKAG DLLRRLAGYR ARTVTLDGCT YREGLCAVRI DGGVAGNVIP
DRCQVTVNFR FAPDRGPDEA VAHVREVLGG YDVEVTDLVG GALPGLAAPH AAAFVAATGR
VPVAKYGWTD VARFAELGIP ALNYGPGDPN LAHARDEYVE LAAIDEAERL LRAYLSGVS