Gene Francci3_4126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4126 
Symbol 
ID3907091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4927773 
End bp4929218 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content72% 
IMG OID637881454 
Productputative replication initiation protein 
Protein accessionYP_483203 
Protein GI86742803 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTTCCG TGGATGTGGG CGTGAGCCTG CCCGAAAACC GGCCGGTCGC GGGTGGGTGT 
TCCCGCCCGA TCCGCCTGTC CGGCCATATT GATCATGTGG ATGTCGGTAC TGGTGAGGTC
CGGCGGGCGT TTACCAGTGC CGGTGAGCCG GGCGGGGTGC TGCATGTGCG GTGCGGGAAC
CGGCGGGAGT CGGCGTGTCC TGCCTGTTCG GCGGTGTACA AGCGGGACGC GCGGCGGCTG
GTCCTGGCCG GGCTGGCGGG TGGCAAGGGT GTGCCGGCAT CGGTGTCGGA TCATCCGGCG
TGGTTCGTGA CGTTGACCGC GCCGAGCTTC GGGCCGGTGC ACTCACGGCG CCAGCGGGGC
GGGAAGACGG GTCCGGTGCG GGCGTGTCAT CCCCGGCGCG GTCTGTGTCC GCATGGGAAA
CCGGCGGGCT GTCATGAGCG GCACCATGAG GCGGATTCCC GGCTCGGGTC GCCGTTGTGC
GCGGACTGTT ACGCCTACGG CCGGTCGGTG GTCTGGAACG CGCTCGTCCC CCGGTTGTGG
AAGGCCACCC GCGACGGCAC GGAATCGGCG GTGGCGGCGG CAGCCGGGCT GACCGTGGCA
GGGCTGCGCC GCGCGGCGCG GCTGTCGTTC GTGAAAGTCG CGGAGATGCA GGCGCGGGGC
GTCGTCCACC TGCACGTAGT GATCCGGGTG GACGGTCCCG ACGGGCCGGG CTCGGCTCCT
CCGGCCTGGG CGGCCGGTGA GCTGGTCGCA GACGCGCTGC GCGGGGTCCT GAGCGTGGTG
GCAGTACCGG CTCCGGATCC GGACGCGCTC ACCCTTGACA CCACCGCCGT TGCGGATGAC
GGCTGGGCGG TGCGCTGGGG CGCACAGGTC GATATCCGCC GGATCGCGTT GGAGGGGCCG
GCGGATGTGT CCCGGGTCAG TAACTACCTG GCGAAGTACA TCACCAAGTC CGCGGCGGCG
GGTGGTGAGT TGGATCATCC GGTGCGGTCG CTGGCCGCAC TCGGCCGGCT GACGCTGCCC
CCGCATGTGC GCCGGCTGGT GGAGACCTGC TGGCGCCTCG GCCACGACCC GGTATTCACG
ACGGCGTTGG ATGCGACGCT GGGCCGGGAC TCCGGCGACG TCCCCCGGCT GATCCGCTGG
GCCCACACCT TTGGGTTCGG CGGCCACTGG TTGACGAAAA GTCGGGCGTA CTCGACGACG
TTCACCGCGC TGCGGACGGT ACGGCGGGTC TGGTCCCGCA CGATCGGCGC GGCCATGGCA
GGCCGGGCAC CGGTGGATGC GTTCGGCCGA GCTGACGGCG ACCCAAACAC GATCGTCCTG
GGTACGTGGT CCTATGCCGG CCGGGGGCTG CACCTCGGGG GCCATGGTGG GGGCTCGGGT
CCGCCTGATC TGCCAGCGTC GCGGGCGGCT GGGAGCCCGG TGGGCCCCTG GCGGGCCGAA
CGGTGA
 
Protein sequence
MVSVDVGVSL PENRPVAGGC SRPIRLSGHI DHVDVGTGEV RRAFTSAGEP GGVLHVRCGN 
RRESACPACS AVYKRDARRL VLAGLAGGKG VPASVSDHPA WFVTLTAPSF GPVHSRRQRG
GKTGPVRACH PRRGLCPHGK PAGCHERHHE ADSRLGSPLC ADCYAYGRSV VWNALVPRLW
KATRDGTESA VAAAAGLTVA GLRRAARLSF VKVAEMQARG VVHLHVVIRV DGPDGPGSAP
PAWAAGELVA DALRGVLSVV AVPAPDPDAL TLDTTAVADD GWAVRWGAQV DIRRIALEGP
ADVSRVSNYL AKYITKSAAA GGELDHPVRS LAALGRLTLP PHVRRLVETC WRLGHDPVFT
TALDATLGRD SGDVPRLIRW AHTFGFGGHW LTKSRAYSTT FTALRTVRRV WSRTIGAAMA
GRAPVDAFGR ADGDPNTIVL GTWSYAGRGL HLGGHGGGSG PPDLPASRAA GSPVGPWRAE
R