Gene PCC7424_5207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_5207 
Symbol 
ID7109539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp5780619 
End bp5781857 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content42% 
IMG OID643483414 
Productvon Willebrand factor type A 
Protein accessionYP_002380423 
Protein GI218442094 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0000000146018 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAGTTC AGATAAGACC CGCAATAAGC GATCGCCATG TAGACGCTAA CCAAACTAAC 
ACTCAACGTC AATTTTCCTT AGCTATTTGT GCAACTGGAG ATCAAGATAA AACGCTACCT
TTAAATTTAT GTCTGGTTCT CGATCACAGT GGATCAATGG CAGGAAAACC CCTAGAAACG
GTAAAACAGG CAGCTATTGA ACTTGTTAAA CAATTAAATG TAGAAGATCG CCTCTCTATT
ATTGCTTTTG ATCACCGAGC TAAAGTTCTT GTTCCCAATC AAGGCATAGA CAACCTCAAC
ACTATTATTG AACAAATAAA CTCCCTTAAA CCAGCCGGCG GAACAGCGAT CGATGAAGGA
TTAAAATTAG GAATTCAAGA GTCAGCGAAT GGAAAAAAAG ACCGGGTTTC CCAAATATTC
TTATTAACCG ACGGGGAAAA TGAACATGGA GATAATGAAC GCTGTTTAAA ACTGGCTCAT
GTCGCCTCAG ATTATAATAT CACTCTCAAT ACGTTAGGCT TTGGTAATCA TTGGAATCAA
GATGTTTTAG AAAAAATTTC TGACTCTGCC GGTGGCACTC TATGCTATAT AGAAACCCCT
GATAAGGCAA TAGAAGAATT TAGCAGACTC TTTAACCGCG CTCAGTCGAT CGGGTTAACT
AATGCCCATC TTATCATTGA TTTAATGCCT CAAGCTCGTC TAGCAGAACT TAAACCCATT
GCCCAAGTTG AACCCGAAAC GGTAGAATTA ACCGTACAAT CGGAAGGCGA TCGCTACAGT
GTCCGCTTAG GAGATTTAAT GATAGACCAA GAACGAGTCA TTTTAATCAA TCTTTATCTG
AGTCAACTGT CTCCGGGACT TCAAACCATC GGTAAAGCAC AAGTCCGTTT TGACGATCCG
GCTTTATCTC AAACCGGCAT CCTTTCCGAA GCGATACCCC TAACCTTAGA AGTGCAGAAG
GTTTATCAAC CCCAACCGAA TGATCAGGTT CGTCAATCTA TTTTAACCTT AGCAAAATAC
CGTCAAACTC AAATTGCTGA AGAAAAACTT AAAGCCGGCG ATCGTCAGGG AGCAGCAACT
TTATTACAAA CTGCCGCCAA AACTGCTTTA CAATTAGGAG ACCAAGGAGC AGCAACTATC
TTACAAACTA GCGCGACTCG CTTACAAATG GGGGAAGACC TGTCAGAAGC CGATCGCAAA
AAAACCCGCA TTGTCGCAAA AACAAGATTA GTAGAGTAG
 
Protein sequence
MRVQIRPAIS DRHVDANQTN TQRQFSLAIC ATGDQDKTLP LNLCLVLDHS GSMAGKPLET 
VKQAAIELVK QLNVEDRLSI IAFDHRAKVL VPNQGIDNLN TIIEQINSLK PAGGTAIDEG
LKLGIQESAN GKKDRVSQIF LLTDGENEHG DNERCLKLAH VASDYNITLN TLGFGNHWNQ
DVLEKISDSA GGTLCYIETP DKAIEEFSRL FNRAQSIGLT NAHLIIDLMP QARLAELKPI
AQVEPETVEL TVQSEGDRYS VRLGDLMIDQ ERVILINLYL SQLSPGLQTI GKAQVRFDDP
ALSQTGILSE AIPLTLEVQK VYQPQPNDQV RQSILTLAKY RQTQIAEEKL KAGDRQGAAT
LLQTAAKTAL QLGDQGAATI LQTSATRLQM GEDLSEADRK KTRIVAKTRL VE