Gene Francci3_1130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1130 
Symbol 
ID3906609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1346663 
End bp1347994 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content71% 
IMG OID637878461 
Producthypothetical protein 
Protein accessionYP_480238 
Protein GI86739838 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.823409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.981294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCACCCAC CTGCGAAGGC GTCGACGGGG AGGCCGAAGG TGCAAGCTGG TCAGCGAAAC 
GACGCACTCG CCAACCTGGC CGATCTGCTG CGGACACTGC GTGATCAGCA TCGGACCAAC
AACAGCCGGC TCGAAGCCCG CACCGGCTTC AAACGCCAGC AGATCTCTCG GGCCGTCAAC
GGCCAGGAGA TCCCGTCCGC GGACCTCGCG GACGCGCTGG ACACGGCGTT GGCCGCAGCG
GGAGCCATCC GGCGATTGCG GGACGAAGCT GTCAGGGAGA AGCGGGCCCG GGACCTGGGG
CTCGACCCGA GCAGGCAGGA GGAACCCGTG GACGCCAACC GTCGGCAGGC GTTCGAACTC
GCCGCCGCGA GCCTCGTCGC CGCCCAGATG TACCGGGAGT GGACCTCGTC CGCCCCCGAT
GTGCTGACCC TCGACGAGAT CGACGACGCG ATCAACGCGC ACACCGTCGC GTTCACCGTC
GAACCGCACC AGCGGCTCGC GCCGAAGGTG TGGAAGACGT GGAAGTCGGC GCACCACCAC
CTGATGAACG GCAGTGGCCG GGCCCGCCCG CAGACCAGGC TCACCGTCGC CGCCGGCTAC
GCCTCCTACA TGCTGTCGCG GCTGTCGTTC AACCTCGGAG ACACCCTGGC TTCGCGCCGG
TTCATCCGCC TCGCCGAAGA CCACGCGAGC CAGACCGACG ACGTGGTGCT GACCGCGTCG
GTCGGGGAGA TGGTGACGAC GCTGGCGTTC TACGGCCGCC GCTACCAGGA AGCCGCCGTC
TCGGCTCGGA AGACCGCGGT CGTGGCGGAC AACCCGTACA CCCGGGCCCG GATCGCCAGC
TACGAGGCGC GGGCCCTTGG CGCGCTCGGC GACGTCGAGG GCACCCGGGC GGCGTTGAAC
CGGATGCGCA CGTCGGTCAC GGACCTGCCG CTGCAGCCCG GGATCAGCCC GTTCGGCCCG
GCCGCCGCCG AGATGATGTA CGCCGGGGTC CTGACCCGGA TCGGCGGTGG CGTCGAGGCC
GAGCCGATAG CCCGAGCCGC GCTCGCCGCC TACGAAGGAG GCCAGGCGGG CGGGTTCGAG
GACTACGGCC ACGCGCTGCT CGCGCTCGCG GCCAGCCTCA CCGCCCGCGA ACAGCCCGAG
ATCGACGAAG CCGCGACCAT GGCCGGGAAG GTCGTCGACA TGCTCGACAC CCGGCCCACC
GCCTCGGTCT CCGACCGGGT CGCAGAGATC GCCATAGCGT TCACCGGCCA CCCCACCGTC
GAACCCGTCC GTGACTTCTG GGACCGCTGG CAGGCACGCC CCCGCCTCGA ACTGACCACG
GGCCAGGCGT GA
 
Protein sequence
MHPPAKASTG RPKVQAGQRN DALANLADLL RTLRDQHRTN NSRLEARTGF KRQQISRAVN 
GQEIPSADLA DALDTALAAA GAIRRLRDEA VREKRARDLG LDPSRQEEPV DANRRQAFEL
AAASLVAAQM YREWTSSAPD VLTLDEIDDA INAHTVAFTV EPHQRLAPKV WKTWKSAHHH
LMNGSGRARP QTRLTVAAGY ASYMLSRLSF NLGDTLASRR FIRLAEDHAS QTDDVVLTAS
VGEMVTTLAF YGRRYQEAAV SARKTAVVAD NPYTRARIAS YEARALGALG DVEGTRAALN
RMRTSVTDLP LQPGISPFGP AAAEMMYAGV LTRIGGGVEA EPIARAALAA YEGGQAGGFE
DYGHALLALA ASLTAREQPE IDEAATMAGK VVDMLDTRPT ASVSDRVAEI AIAFTGHPTV
EPVRDFWDRW QARPRLELTT GQA