Gene Francci3_2745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2745 
Symbol 
ID3906456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3233966 
End bp3235897 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content71% 
IMG OID637880068 
Producthypothetical protein 
Protein accessionYP_481834 
Protein GI86741434 
COG category[S] Function unknown 
COG ID[COG2898] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.925989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTGC TGCCCCCCTG GAAACAGCCG TGGGCGCCTC GGATCGCCGC GCTGCTGGCT 
CTGGTTGTCG GCCTGGTGGA CGTCATCTCC GCGGTCACCC CGGAGTGGCG CTCCCGCCTG
GAGGACCTGC GGGTGCTCCT GCCCCCCGCG GCCTCCCAGC AGGCCGCCGC GTTCACCGTC
GTCGTCGGCA TGCTGCTCGT GCTGCTGACG CCGGGGCTGC GCCGCCGCAA GCGGCGGGCT
TGGCGGGCCG TGGTGGTCCT GCTCGGCGTG AGCATCGTGC TCCACGTTGC CAAGGGTCTG
GACTACGAGG AGGCGGCCGG ATCGGCCGCG CTGCTCATCG CCCTGCTGCT GGTCCACGGG
GAGTTCCGGG CCAAGGGCGA CCCGTCGACG CGTTGGCGGG CGCTCGGCGT CGGCCTGCTG
CTCACTGTCG TCTCGATCGG GATCGGGTTC CTGCTGCTTT ATGTGCGCCA GGACCGGATC
GTCGGCCCGC ATTCGCTGTC CGCCCAGTTC GAACAGATCG TCGAGGGACT CGTCGGGATC
CCCGGTCCGC TACGGTTCAC CTCCGACCGG TTCTCCGGCC TCGCCGCCCG CGTGCTGCTG
ACGATGGGCC TGTTGACCAT AGTGACCACC GGCTACCTCG CGCTGCGACC GCCCGAGCCC
CGGCCCCGGC TGACCGACTC CGACGAGGAA CGGATACGTG CCCTGCTGGC CCGGCACGGC
AAGGCCGACT CGCTGGGATA CTTCGCGCTG CGGTCGGACA AGTCGGTGAT CTGGTCACCG
ACCGGCAAGG CCTGCGTGGC CTACCGGGTG GTCTCCGGCG TGATGCTGGC CAGCGGAGAC
CCGCTCGGTG ACCGGGAGGC CTGGCCGGGG GCCATCAGGG AGTTCCTGCG TGAGGCGGCC
GATCATGCGT GGACGCCGGC GGTCCTCGGC TGCTCGGAGG CGGGTGGCTT CGCCTGGACC
CGCGCCGGGC TGTGCGCGCT GGAATTCGGT GACGAGGCCA TCGTCGACAC GGCGTCGTTC
ACGCTGGAGG GCCGGGCGAT GCGTAACGTT CGCCAGGCGG TCGCCCGGGT GGAACGCGCC
GGCTACACCG CGGTCGCGCG GCGGGTCGGC GATCTCGCGC CGGCCGACAT CGCCCGGCTG
AAGGCACAGG CCGCTGCCTG GCGGGGTACC CAGACGGAAC GGGGCTTCTC GATGGCGCTC
GGCCGCCTCG GCGGCGGCGC GGACGGCGAC TGCGTAGCCG TGATGGCGTT CTCCCATGAC
GCGGGTCCCC ATGACGCGGG TCCCCACGAC GCGGACGACC CGGTGAACGG TGCTCCGGGC
CACCCGGCGA ACGACACGAC AAGCGGCATC GAGGACGGCA CGTCGCATGC CGACGCGGCC
GGCACCGAGC CCCGGCTGCG CGCCCTGCTG CATTTCGTGC CGTGGGGCCC CAACGGCCTG
TCGCTGGACG CGATGACGCG GGACCGGACC GCCGACAACG GGCTGAACGA GTTCCTCATC
GTCAGCGCCC TGCGCCAGGC CCGCGAGCTG GGCGTCGAAC GGCTGTCGCT GAACTTTGCG
TTCTTCCGGT CCGCGCTCGA ACGCGGCGAG CGCCTGGGCG CCGGGCCGGT CATCCGCTGC
TGGCGTTCCC TGTTAATATT CTTGTCCCGC TGGTTCCAAA TCGATAGCTT GTACCGGTTC
AACGCCAAAT TTCAACCCAC CTGGCAGCCC CGTTATATCT GCTACCCCGC CAGTTCCGAG
CTACCACGGA TCGCCTTGGC GATGCTCGAA GCCGAGGCCT TCCTGGTCTG GCCCTGCTGG
CGCGACCATC TGTCCGGGCT ATCCCGGTTG TCCCGACTAT CCCGGTTGTC CCGACTGCCC
CGGCCTGGGA CGGCCGGCCT CCATCACCGT GCCCGAAGAA AGGACACCTC GACCGGCCCC
GGGCCGGACT GA
 
Protein sequence
MPVLPPWKQP WAPRIAALLA LVVGLVDVIS AVTPEWRSRL EDLRVLLPPA ASQQAAAFTV 
VVGMLLVLLT PGLRRRKRRA WRAVVVLLGV SIVLHVAKGL DYEEAAGSAA LLIALLLVHG
EFRAKGDPST RWRALGVGLL LTVVSIGIGF LLLYVRQDRI VGPHSLSAQF EQIVEGLVGI
PGPLRFTSDR FSGLAARVLL TMGLLTIVTT GYLALRPPEP RPRLTDSDEE RIRALLARHG
KADSLGYFAL RSDKSVIWSP TGKACVAYRV VSGVMLASGD PLGDREAWPG AIREFLREAA
DHAWTPAVLG CSEAGGFAWT RAGLCALEFG DEAIVDTASF TLEGRAMRNV RQAVARVERA
GYTAVARRVG DLAPADIARL KAQAAAWRGT QTERGFSMAL GRLGGGADGD CVAVMAFSHD
AGPHDAGPHD ADDPVNGAPG HPANDTTSGI EDGTSHADAA GTEPRLRALL HFVPWGPNGL
SLDAMTRDRT ADNGLNEFLI VSALRQAREL GVERLSLNFA FFRSALERGE RLGAGPVIRC
WRSLLIFLSR WFQIDSLYRF NAKFQPTWQP RYICYPASSE LPRIALAMLE AEAFLVWPCW
RDHLSGLSRL SRLSRLSRLP RPGTAGLHHR ARRKDTSTGP GPD