Gene Francci3_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1105 
Symbol 
ID3905776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1318991 
End bp1320430 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content70% 
IMG OID637878437 
Producthypothetical protein 
Protein accessionYP_480214 
Protein GI86739814 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0363479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.305045 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGTG TGCAGGACGA ACCAGGACGC GTCGAGGCTC TGGGACGGCT GTGTCGCTTC 
CGGCTGGAGT TCTACGACTG CCTGACCCGC CGAGCGGACG CGCTGTTCGA AACGGCGGAG
GCGGTGCTGT GCACCGACGG CCCGGTCCGG ACGCTGGTCG ACCTGACACT GGCCCCGGAG
CACCGTCGCG GCCATGGAGC CTTGTACGAC GGGCTGAACA GCGGCCGATT AGAGATCGCC
CGGCTGCGAC GTGCGCTCGC GGACCTTCCG CTGCCCGCGG CCGCTGACGG ACGACTCGTG
CTGGCCGTCG ATGTCAGCCC ATGGCTGCGC TCGGACGCCT CGACCAGCGC GGAGCGGCTG
TTCTGCCATG TTCATGGTCG CGCGAAGAAC CAGTCCCAGC TGATTCCTGG CTGGCCGTAC
TCGTTCGTCG CGGCCCTCGA GTCCGGCCGG ACGTCGTGGA CCGCGCTCTT GGACGCAGTT
CGCCTCGGCC CCACCGACGA CGCCACAGCG GTGACCGCCG ACCAGCTCCG GGCGGTCGTG
GGCCGGCTGA TCGCCGCCGG GCACTGGCAC GACGGAGACC CGAACATCCT GATCGTGATG
GACGCCGGGT ACGACGTGAC CCGGCTGGCG TTCGTCCTGG CCGACCTGCC TGTCGAGGTG
CTCGGCCGGA TCCGTTCCGA CCGTGTCCTG CGCCTGGCCA AACCACCGAG ACAGCCGGGT
ACCAACGGCC GTCCGCCCAA GCACGGCCCC GAGTTCGCCC TCGACAGGCC CGCGACTTGG
CCCGAACTGC AGCACACCAC GACCACCAAC ACCAGCCGCT ACGGCACCGC CACCGCGACC
AGCTGGAACC GGCTACACCC CCGGCTCACC CACCGCACCT GCTGGCTCGA CCACCCCGGA
GACCTACCGA TCATCGAAGG GACCCTCATC CGCCTGCAGG TCGACCACCT CCCCGGCGAC
CGCGACCCCA GGCCCGTCTG GCTGTGGTCC TCCGCGGTTG ACGCCACCGC CACCGACATC
GACCGCGCCT GGCAGGCGTT CCTGCGCAGG TTCGACCTGG AACACACCTT CCGACTGTTC
AAACAGACCC TCGGCTGGAC CCGCCCGAAG ATCCGAACCC CGCAGGCCGC GGACCGCTGG
ACCTGGCTGA TCATCACCGT CCACACCCAG CTCCGCCTCG CCCGACCCCT GGCCCGCGAC
CTACGCCGCC CCTGGGAGAA ACCCGCCCCA CCAGGACGAC TCACGCCCGC CCGAGTCCGA
CGAGGATTCC GGAACATCCG CGCGATCATG CCCCTCCCCG CCAGCGCACC GAAACCCACC
AAGGCTGGCC CCGGCCGCCC TCCCGGCTCA CGCAACCGCA GACCCGCACC CCACCACGAC
GTCGGAAAAA CCATCCGACG GGACCTCACC ATGACCGCCC ACCAACACCG CACAGGTTAA
 
Protein sequence
MGSVQDEPGR VEALGRLCRF RLEFYDCLTR RADALFETAE AVLCTDGPVR TLVDLTLAPE 
HRRGHGALYD GLNSGRLEIA RLRRALADLP LPAAADGRLV LAVDVSPWLR SDASTSAERL
FCHVHGRAKN QSQLIPGWPY SFVAALESGR TSWTALLDAV RLGPTDDATA VTADQLRAVV
GRLIAAGHWH DGDPNILIVM DAGYDVTRLA FVLADLPVEV LGRIRSDRVL RLAKPPRQPG
TNGRPPKHGP EFALDRPATW PELQHTTTTN TSRYGTATAT SWNRLHPRLT HRTCWLDHPG
DLPIIEGTLI RLQVDHLPGD RDPRPVWLWS SAVDATATDI DRAWQAFLRR FDLEHTFRLF
KQTLGWTRPK IRTPQAADRW TWLIITVHTQ LRLARPLARD LRRPWEKPAP PGRLTPARVR
RGFRNIRAIM PLPASAPKPT KAGPGRPPGS RNRRPAPHHD VGKTIRRDLT MTAHQHRTG