Gene Francci3_4122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4122 
Symbol 
ID3907087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4924560 
End bp4925798 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content67% 
IMG OID637881450 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_483199 
Protein GI86742799 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTGT TATTCCCACG GTGCGCGGGC CTTGACGTGC ACCGGGACAC GGTCGCCGCG 
GCGGTGCGCA TCCAAACCGG TTCGGGTAAA GCCGTCACCG AGGTCCGGAC GTTCACCACG
ACCGGGGGCT CGCTCGGACT ACTCGCGGAC TGGCTCACCG AGTGCCGGGT GACGATCGTC
GGGATGGAAT CGACAGGCGT CTACTGGAAG CCGGTGTTCC ACCTGCTGGA AGACCGGTTC
GAGTGCTGGC TCCTCAACGC CACCCACGTC CGCAACGTAC CAGGCCGAAA AACAGACGTC
GCGGACGCGG CGTGGATCTC GGACCTCGTC GCGCACGGCC TGGTACGCGC CTCGTTCGTG
CCGCCGAAAC CCCAGCGGGA CCTGCGTGAC CTGACCCGGG CCCGGCGGAT CGTGGTCGAG
GAGAAGACCC GGGAGATCCA GCGGCTGGAG AAACTGATGC AGGACGCCGG CGTGAAACTC
ACCAGCGTCG CCTCCAAGCT ACTCGGGGTC TCGGGCCGTG CGATCCTGGA GAAGATGATC
GAGGGAGAGC AGTCCCTGGA ATATCTCGCT GATCAGGCCC GTGGCCGACT CCGCAGCAAG
ATCCCACAGT TGCAGGAGGC ACGCGCGGGA ACGTTCCGCT CCGGGCATCA CGGGTTCCTC
GCCGCGCAGC TCCTGGCCCG GATCGACCTG TGTGACGAGC AGATCGACGA GCTCGACCAC
CGGATCGAGG TGATGATCGC CCCTTTTCGG GAGACGGTCG ACCGGATCCG CACGATCACC
GGGGTCGGTG AGGTCACCGC GACCGTGCTG CTCGCCGAGG TCGGCCTGGA CATGAGCCGG
TTCCCCACCG CCGGCCATCT CGCGTCCTGG GCGGGTATCT GTCCGGGGAA CAACACCTCG
GGAGGGAAAC GCCTGTCCGG GCGGACCCGA CACGGTAACA AGTGGTTACG TACCGCGTTG
ACCGAGGCCG CGCACGCCGC CGCCCGGAGC AAGGACACCT ACCTGGCGTC CCACCACGCC
CAGGTCCGTG GCCGCCGCGG TGTCCTGAAA GCGATCGGCG CGACCCGCCA CGACATTCTC
ATCGCCTACT GGCACATCAT CGCGAACAAG ACCGTCTACC AGGACCTCGG CGGAGACTGG
CATGCCCGCC GACGCCGGGA CCCTGAACGC CGCCGGAAGA ACCTCGTCGG CGAACTGGAG
AAACTCGGCT ACACCGTCAC CATCACACCA GCGGCATAG
 
Protein sequence
MQVLFPRCAG LDVHRDTVAA AVRIQTGSGK AVTEVRTFTT TGGSLGLLAD WLTECRVTIV 
GMESTGVYWK PVFHLLEDRF ECWLLNATHV RNVPGRKTDV ADAAWISDLV AHGLVRASFV
PPKPQRDLRD LTRARRIVVE EKTREIQRLE KLMQDAGVKL TSVASKLLGV SGRAILEKMI
EGEQSLEYLA DQARGRLRSK IPQLQEARAG TFRSGHHGFL AAQLLARIDL CDEQIDELDH
RIEVMIAPFR ETVDRIRTIT GVGEVTATVL LAEVGLDMSR FPTAGHLASW AGICPGNNTS
GGKRLSGRTR HGNKWLRTAL TEAAHAAARS KDTYLASHHA QVRGRRGVLK AIGATRHDIL
IAYWHIIANK TVYQDLGGDW HARRRRDPER RRKNLVGELE KLGYTVTITP AA