Gene Francci3_3399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3399 
Symbol 
ID3905981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4030804 
End bp4032234 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content70% 
IMG OID637880721 
Productputative replication initiation protein 
Protein accessionYP_482482 
Protein GI86742082 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00332315 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACACGCC CTATCGATAC CGTGACCAAA CCGGGTGAAG GGACGGTGGC CGGGGGCTGC 
ACACACCCCA TCCGCCTCGC TGGCCACACC GATCATGTGG ACATCACGAC CGGGGAGGTA
CGTCGGGCGT TCTCCTCAGC CGGACAGCCG GGCGGGGTGG TACACGTGCG GTGCAACAAC
CGGCGCGCTT CCGTCTGCCC CTCCTGCTCC AAGCTCTACC AGCGAGACGC CCGGCGGATC
GTGCTGTCCG GGCTGGCCGG TGGCACGAGC GTGCCCGACA CGGTCAGCGG TCATCCGGCG
CTGTTCGTGA CGTTGACCGC CCCGAGTTTC GGCGCGGTGC ACTCCCGCCG CAGCAAAGGG
AAGCGGGGCG AGGCGCGGGC GTGCCGGCCT CGGCGGGGCG AATGCCCGCA TGGACGGCCG
GCGGGCTGCC ACGCCCGGCA CCGGGACGGG GATCGGTTGC TCGGAACACC GCTGTGCCCG
GACTGCTTCG ACTATCCGGG CGCGGTTACG TGGAACAGTC TCGCGCCGAT GCTATGGAAG
GTCACCCGGG ATCGAATGGA ATCGGCGGTC GCGGCGGCGG CGGGGTTGAC GGTGGCCGGT
CTGCGCCGCG TCATACGGAT CACCTGCGTG AAGGCAGCGG AAATGCAGGC CCGCGGCCTG
GTCCACCTAC ACGCCGTCAT CCGGATCGAC GGGCGCGGCG AGAACCCGAA TGACCTCGTG
GCACCACCCG CGTGGGCGAC AACAAAGCTT GTCGCCCACT GCCTGCGCAC AGTCCTCGGG
GAAGTGGCCA TACCTGCTCC CGACCCGAAC CACCCCGGCG GCGTGACCCT CGTGCGCTGG
GGTGACCAGC ATGACCTACG CCCCATCACC CTGGACGGAC CCGCCACAAG CGGGAAGATC
GCAAACTACC TCGCCAAGTA CCTCACCAAG AGTGTCACCG CCGGCGGCCT GCTCGACCGG
CCCGTACGTA GCCTCGGGCA TCTCGCCCGC ATCCCCCTCA ACCCCCACGC CCGGCGGATG
GTGGAAACCT GCTGGCAGCT CGGCCAGGAC GAGGACTTCA CCGCGGCACT CGACGAAGCC
ACCGGACGGC AACCCGGCCG TCTCCCCGCT CTGATCCGCT GGTCCCACGC GTTCGGCTGG
GGCGGACACT GGATTTCCAA GAGCCGCCGG TACTCCACCA CCTTCGGCAC ACTGCGCGCC
GCACGCCGCA CCTGGGCACG CACCATCGGC GCCGTCCTGG CAGGCCGTCC GGTCGGCGAC
GCCTTCAGCC GCCCCGACAA CGACCCCCGC ACCACCATCC TCAGCACCTG GCGCTACGCC
GGACGGGGCA CAACCCCCGA CATCGATGCA CACAGCCGGA GCGGACACGG ACCCCCATCC
CACCGCGCGA CCCTAACCCC GACCACCCGC GGCGGAGACG ACGATGCCTG A
 
Protein sequence
MTRPIDTVTK PGEGTVAGGC THPIRLAGHT DHVDITTGEV RRAFSSAGQP GGVVHVRCNN 
RRASVCPSCS KLYQRDARRI VLSGLAGGTS VPDTVSGHPA LFVTLTAPSF GAVHSRRSKG
KRGEARACRP RRGECPHGRP AGCHARHRDG DRLLGTPLCP DCFDYPGAVT WNSLAPMLWK
VTRDRMESAV AAAAGLTVAG LRRVIRITCV KAAEMQARGL VHLHAVIRID GRGENPNDLV
APPAWATTKL VAHCLRTVLG EVAIPAPDPN HPGGVTLVRW GDQHDLRPIT LDGPATSGKI
ANYLAKYLTK SVTAGGLLDR PVRSLGHLAR IPLNPHARRM VETCWQLGQD EDFTAALDEA
TGRQPGRLPA LIRWSHAFGW GGHWISKSRR YSTTFGTLRA ARRTWARTIG AVLAGRPVGD
AFSRPDNDPR TTILSTWRYA GRGTTPDIDA HSRSGHGPPS HRATLTPTTR GGDDDA