Gene Francci3_0130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0130 
Symbol 
ID3903407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp160042 
End bp161406 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content71% 
IMG OID637877464 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_479253 
Protein GI86738853 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTGC TGATCGACCG GGTCGCCGGG TTGGACGTGC ACCGGGACAC GGTCGTGGCG 
GCGGTCCGGG TTGGCGGGCG TGGTGGCGGC CGGCGGGGGG AGGTGCGGAC CTTCGCCACG
ACGGGAGCGG GACTGACCCG GCTGGCCGGG TGGCTGTCGG AACAGCGGGT TTCTCTGGTG
GGTATGGAAT CCACGCCTGA CTACTGGCGC CCGTTCTACT ACCTGCTCGA GGCCCGCGGT
CTCACGGTGT GGCTGGTTAA CGCCCGGGAC GTCAAGAACG TCCCGGGAAG ACCCAAAACG
GACAAACTCG ACGCGATCTG GCTGGCCAAA CTCAACGAGC GTGGCATGCT ACGGCCCTCC
TTCGTGCCGC CACCCGAGAT CCGCGAGATC CGTAACCTCA CCCGGCTGCG CCTGGACCTG
ACCGCCGAGT GCACCCGGCA CCGGCTGCGG GTCGAGAAGC TCCTCGAGGA CGCCCTGATC
AAACTGTCGA CGGTGCTGTC GGACATCTTC GGGGTCTCCG GTCGGGCGAT GCTCGACGCG
CTCGTCGCGG GCGAACGTGA TCCGAAGAAG CTCGCGGCGC TTGCCCGCGG CCGGGTCAAG
GCCACCCAGG CCGAGCTGGC GACCGCGCTG ACCGGCCAGT TCACCGAGCA CCACGGTTAC
CTGCTCTCGG TGCTGCTCGC CCAGATCGAC GGCCTCGATC GGCGGATCGC CGAGCTCACC
GAGCGGATCG ACACCGCGAT CGCCGCCCTG CCTGCCCCGG CCCACGCCGC CGCCGACGCC
GCCCGCGGTG GCGAGACCGG CCCCGACGGG GACGCCACCA CCGGGACCGG CCAGGGCGGC
GGTGGCGCCG CGGCCCGGCC CGGCCTGGAC ATCCTCGACC GGCTCGACGA GATCCCCGGC
ATCGCCCGCC ACGCCGCCCA GGTGATCATC GCCGAGATCG GGACCGACAT GGCCCAGTTC
CCGACCTCCG GCCACCTGAA CTCGTGGGCG AAACTGACCC CCCAGACGAT CCAGTCCGGC
GCGAAAAGCC GCACCGGGCG CACCGGCAAG GGCAACCCCT ACCTGCGCGG CGCCCTCGGG
GAGGCCGCCA TGGCGGCGGC GAAGACGAAG ACCTTCCTCG GCTCCCGCTA CCGGCGCCTC
GTCAAACGCC GCGGCCATCT CAAGGCCCTC GTCGCCGTCG CCCGCTCCAT CCTGACCATC
GTCTGGCATC TGCTGAACGA CCCCACCGCG CGGTTCCATG ACCTCGGAGT CGACTACCAC
GCCAGCCTGC AGAGCAGGGA ACGCCGCAAG CGCAACGCCC TGCGCGAGCT CAAGAGCCTC
AACCTGAGCG CACAGGAGAT CACCGCCCTG CTCGCCGCGG CCTGA
 
Protein sequence
MDVLIDRVAG LDVHRDTVVA AVRVGGRGGG RRGEVRTFAT TGAGLTRLAG WLSEQRVSLV 
GMESTPDYWR PFYYLLEARG LTVWLVNARD VKNVPGRPKT DKLDAIWLAK LNERGMLRPS
FVPPPEIREI RNLTRLRLDL TAECTRHRLR VEKLLEDALI KLSTVLSDIF GVSGRAMLDA
LVAGERDPKK LAALARGRVK ATQAELATAL TGQFTEHHGY LLSVLLAQID GLDRRIAELT
ERIDTAIAAL PAPAHAAADA ARGGETGPDG DATTGTGQGG GGAAARPGLD ILDRLDEIPG
IARHAAQVII AEIGTDMAQF PTSGHLNSWA KLTPQTIQSG AKSRTGRTGK GNPYLRGALG
EAAMAAAKTK TFLGSRYRRL VKRRGHLKAL VAVARSILTI VWHLLNDPTA RFHDLGVDYH
ASLQSRERRK RNALRELKSL NLSAQEITAL LAAA