Gene Francci3_0466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0466 
Symbol 
ID3903197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp545444 
End bp546793 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content72% 
IMG OID637877797 
Producttetratricopeptide TPR_2 
Protein accessionYP_479581 
Protein GI86739181 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.924549 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.198291 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACGG ATAACGCTCA TGCCTGCACT CATCCGCTTG CGTTCATCCG AGCCCAGCGC 
GGGTGGTCCT ACCAGCGTCT GGCGCGGGTC GTGGCCCGCC GCGCACGTGA CCTCGGGGTG
GCGAACATGG CCGCCGAGCG GCAGAAGGTC TGGCGCTGGG AGCACCGTGG GGTCGTGCCC
GACCGGGTGT CCCAGCTCGC GCTCGCCGCG GAGCTCGGCG TGCCCAACGA TCGCCTGGAG
TCGCACCCCT GGCCCGCGTG GCTACCGACC GGCGACGCGG TGCGCACCGA GTACCCGTGG
ACGCCGGGTG GCAGCATAAC CTCGATCATG GACGTCGTCG AGGACGCGCT CTCCGACCGC
CGCGGCTTCC TGACGATCAC TGGGACCGGG GTGGCGGAAC TCGCGACGCA GTGGCTCGGC
ATGGAACCGG CCCGGCTGGC GGCGGCCCTG AACGGGGGAC GGGTCGACGA CCAGATCGTG
AACCGGATCG AGCACAACAT CCCCGGGCTA CGGGTCATGG ATGAGCGGCT CGGTGGGGAG
AGCGTGCGCC GGCTGGTCGA CGCCGAGCTC GGCGTGGTGG CGGACCTGCT GGCGCGCGGA
TCCTATACCG AGCACGTGGG CCGTCACCTG CATCTGGTGG CCGCGGAGCT CGCCCGGTTC
GCGGGATGGG TCTCGTTCGA CGCCGGCTTC CAGACGGCCG CCCAGCGGTA CTGGATCACC
GCGCTGCATG CCGCGCACGC CGGCGGGGAC CGCATGCTCG GTGCGAACGT GCTGAAGAAC
ATGTCCCTGC AATGCGTGGA CTTCGCCCGC CCACGGGAGG CGGTGGATCT GGCCGAGGCC
GCCGTGGCCA GCGCCGGGGG GGCGTCCGGT CGCGTCGGCG CCATGCTGCA CATGCGGCGG
GCCCGTGCCC ATGCCGCGCT CGGGGAGGCC AGCGCCTGCG CGCAGGCGCT GGCCTGCTCG
GAGGAAGCGA TGGTCACCGC GCGGCCTGAG GAGCCCGCCT GGTCGTCCTA CTTCGACGAG
GCCGAGTACC AGGCGCAGAT CGGCAGCTGC TACATCGATC TCGGTCACCT CGCGCAGGCG
GACCGGTGGC TGGAACGCTC CCTGGCGATC CAGCCGGACT CCCGGGCCCG GGACCGGGCC
ACCTACCTGC TTCGGTGGGC CGCGGTCCAG ATGGATCTCG GTAACGTCGA TCACGGGTGC
GAGCTGACCC GCCAGGCCCT GCCGATGCTG GCGGCGACGC GATCCAAGCG CAACGCCCGC
CGGGCCGACG AGCTGCGCCG CCGGCTGCGG CGGCACGGCA CCGACCCGGC GGTGCGGGAA
CTCGATCAGA TCCTCGCCCG GACTGTCTGA
 
Protein sequence
MLTDNAHACT HPLAFIRAQR GWSYQRLARV VARRARDLGV ANMAAERQKV WRWEHRGVVP 
DRVSQLALAA ELGVPNDRLE SHPWPAWLPT GDAVRTEYPW TPGGSITSIM DVVEDALSDR
RGFLTITGTG VAELATQWLG MEPARLAAAL NGGRVDDQIV NRIEHNIPGL RVMDERLGGE
SVRRLVDAEL GVVADLLARG SYTEHVGRHL HLVAAELARF AGWVSFDAGF QTAAQRYWIT
ALHAAHAGGD RMLGANVLKN MSLQCVDFAR PREAVDLAEA AVASAGGASG RVGAMLHMRR
ARAHAALGEA SACAQALACS EEAMVTARPE EPAWSSYFDE AEYQAQIGSC YIDLGHLAQA
DRWLERSLAI QPDSRARDRA TYLLRWAAVQ MDLGNVDHGC ELTRQALPML AATRSKRNAR
RADELRRRLR RHGTDPAVRE LDQILARTV